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RELATIVE RESISTANCE TO EXTINCTION OF ESCAPE 
TRAINING AND AVOIDANCE TRAINING! 


BY FRED D. SHEFFIELD AND HELENA WELLISZ TEMMER 
Yale University 


PuRPosE 


Hilgard and Marquis (4) distin- 
guished between ‘escape training’ and 
‘avoidance training.’ In escape train- 
ing, the animal learns a response which 
terminates a punishing stimulus; in 
avoidance training he learns a response 
which prevents onset of a punishing 
stimulus. Thus escape training is 
exemplified if on successive trials an 
animal is placed on a charged grill and 
learns to terminate the shock by run- 
ning to an uncharged platform; avoid- 
ance training is exemplified if in the 
same set-up a short time-interval is 
always allowed during which the ani- 
mal can prevent the shock by getting 
to the safe platform before the grill is 
charged. The present experiment 
compared the strength of conditioning 
resulting from the two procedures 
during acquisition and during extinc- 
tion. The purpose was to test an 
implication—based on conditioning 
theory—that the escape procedure 
would produce a more vigorous and 
dependable response during acquisi- 


1The authors are grateful to Dr. Neal E. 
Miller for the loan of the apparatus and labora- 
tory facilities used and for helpful suggestions in 
the preparation of the present manuscript. 





tion but that the avoidance procedure 
would produce greater resistance to 
extinction. 

THEORY 


It is assumed that responses instru- 
mental in removing pain or fear will be- 
come conditioned to the concomitant cue 
patterns. It is also assumed that com- 
plete avoidance of pain has no reinforc- 
ing effect in itself, rather it leads to ex- 
tinction both of conditioned fear and of 
the response instrumental in avoiding 
pain. This differs from the assumption 
by Schlosberg (7) and by Hilgard and 
Marquis (4) that avoidance of a noxious 
stimulus strengthens the instrumental 
response, an assumption previously 
shown (8) to be unnecessary and contrary 
to fact. 

A further assumption is that the vigor 
of an instrumental response can, to a 
large extent, vary independently of the 
certainty that it will be performed. Thus 
amplitude and frequency are not treated 
as merely alternative indexes of strength 
of conditioning but are assumed to de- 
pend on the conditions of training. 
Vigor depends primarily on the motiva- 
tion present during training; certainty 
depends primarily on the amount of 
practice at performing the response to the 
cue patterns in use. Average vigor is a 
resultant of both certainty and the mo- 
tivation present during training. 
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A final assumption is that learning is 
fairly specific to cue patterns present at 
the time the instrumental response is 
performed. This follows Guthrie’s (3) 
emphasis on the variability of the total 
cue pattern in most learning situations 
and the need to connect the response to 
all of the patterns likely to occur when 
strength of conditioning is tested. Pres- 
entation of a novel pattern results in a 
generalization decrement in certainty of 
response, which may also show up as a 
decrement in vigor, especially if the re- 
sponse is competing with other responses 
to distracting cues. 

With these assumptions in mind it is 
expected that the acquisition curve for 
escape training will rise to a maximum of 
certainty and vigor, the final stable 
level of vigor being determined chiefly 
by the severity of the painful stimulus. 
Motivation based on pain and innate fear 
is fairly constant in escape training—if 
anything it increases somewhat because 
of acquisition of conditioned anticipatory 
fear to the cues present at the outset of a 
trial. With avoidance training, on the 
other hand, it is expected that while the 
acquisition of certainty will rise to a 
maximum value, the curve for vigor will 
first rise and then decline to a minimal 
value. This follows because while moti- 
vation is based on both pain and innate 
fear on early trials, the increase in cer- 
tainty of response as training continues 
leads to more and more frequent omis- 
sions of pain, leaving conditioned fear 
as the chief motivation for the response 
after extended training. Moreover, the 
frequent successive omissions of the pain- 
ful stimulus as certainty increases should 
result in extinction of conditioned fear, 
reducing motivation to a level at which 
vigor is just strong enough to avoid the 
pain most of the time. Vigor cannot 
consistently fall below such a level be- 
cause motivation (conditioned fear) is 
maintained by the automatic reintroduc- 
tion of pain whenever vigor falls to too 
low a level. A final stable level of vigor 
is expected, in which the weakening 
effects on motivation, produced by omis- 
sion of pain, are exactly offset by the 
occasional reintroduction of pain. The 
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final instrumental response is one of 
minimal vigor and maximal certainty. 

These inferences apply only during 
continued training. When extinction by 
omission of all shock begins, the similar- 
ity or dissimilarity of the cue patterns 
present during training and extinction 
looms in importance. Practically all 
forms of experimental extinction involve 
some changes in the cue patterns present 
during training. But the changes are 
especially striking when extinction is 
achieved by the omission of the painful 
stimulus. Escape trained animals have 
practiced the escape response only to cue 
patterns in which dominant features are 
the presence of pain and resultant innate 
fear, tenseness of muscles, vigorous 
activity, and so forth. The first extinc- 
tion trial without pain provides a marked 
change in such cues and should produce 
an immediate loss in certainty of response 
because of the generalization decrement. 
Failure of response brings still more novel 
cue patterns—sitting down, starting to 
groom, exploration, crouching, and so 
forth. These provide cue patterns never 
permitted during training and therefore 
never connected either to the escape 
response or to the fear reaction. 

By contrast the very cues most likely 
to result in failure to respond in escape 
animals are most likely to evoke the 
avoidant response in avoidance trained 
animals. After extended training they 
have had ample practice at performing 
the instrumental response without the 
presence of pain as part of the cue pat- 
tern. Also they have been through a 
number of extinction series and have had 
the cue patterns associated with sitting 
down, starting to groom, etc., followed by 
reintroduction of pain and the reinforce- 
ment both of conditioned fear and the 
instrumental response. In fact, such 
cue patterns—those correlated with fail- 
ure to respond-—are the only ones that 
get reinforced 100 percent of the time in 
avoidance training. Failure to respond 
in time is the only condition regularly 
reinforced, and all of the cue patterns 
signifying such failure are conditioned to 
the response in the course of avoidance 
training. In effect the animal is given 





EXTINCTION OF ESCAPE AND AVOIDANCE 


specific training in being resistant to 
extinction. 


Metuop 


The escape and avoidance proce- 
dures used in the experiment differed 
as little as possible except with re- 
spect to the avoidable or unavoidable 
nature of the noxious stimulation. 
The only difference in procedure was 
the difference in time of onset of 
punishment on each trial; it was nec- 
essary to provide at least a brief time 


interval for the avoidant response to 
be made: 


1. In the escape procedure a rat was 
placed on a charged grill and allowed 
to terminate shock by getting onto an 
uncharged adjacent floor. 

2. The avoidance procedure was the 
same as the escape procedure except 
that shock started one-and-one-half 
seconds after the animal landed on the 
grill. 


Both procedures involved escape at 
the outset of training—objectively 
stated, the two procedures differed 
only in whether the shock was immedi- 
ate or slightly delayed. But when- 
ever avoidance animals ran off the 
grill in time, they avoided shock 
completely. 


Details of apparatus and procedure 


The apparatus consisted of an enclosed run- 
way 54 in. long, half the length of which had a 
grill floor for administering shock and the other 
half of which had smooth flooring. The two 
halves were different in color of ceilings and walls, 
the grill compartment being white and the safe 
compartment black except for a common glass 
wall for purposes of observation. The floors of 
each half were hinged and equipped with 
switches for operating a chronoscope to deter- 
mine response-time between landing on the grill 
and arriving in the safe compartment. Between 
the two halves was a sliding door used to prevent 
retracing. 

A telechronmotor driven timer, energized 
when the animal’s weight depressed the grill, 
controlled the time interval between landing on 
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the grill and delivery of shock for avoidance 
training. The shock was taken from a radio 
transformer: 90 volts was used, and the shock 
was delivered to the animal through a 250,000- 
ohm resister in series with the grill. This is a 
moderately weak shock in terms of activity 
produced in mature rats. 

The subjects were 20 male albino rats. The 
animals had, about a month previously, served 
in an experiment using food reward; they had no 
previous experience with shock. The 20 animals 
were grouped into 10 pairs such that animals in 
a pair differed by no more than 10 gm. in weight. 
The range in weight was from 250 to 385 gm. 
One purpose of the pairing was that the same 
shock setting was used for all animals, making 
the punishing effects of shock partly dependent 
on weight. 

At the outset of each experimental session a 
coin was tossed to determine which rat in a pair 
would get escape training and which would get 
avoidance training. The time between trials 
was controlled by watching the hand of a stop- 
watch. Every 60 sec. a rat was placed on the 
grill, and since the members of a pair were run 
in alternating trials, each rat received a trial 
every two min. It is apparent that the proce- 
dure of running animals in pairs controlled for 
any possible variation in manner of handling the 
animals during the course of the experiment. 

Partly to enable maintaining the constant 
timing of trials and partly to reduce variation in 
the duration of shocks in early trials, the animals 
were gently pushed off the grill with a large 
paddle if they failed to get off within 20 sec. 
In putting an animal on the grill care was taken 
to handle the animal in a constant manner, and 
an attempt was made always to place him in 
exactly the same position, pointed toward the 
safe compartment, with his hind end about an 
inch from the rear wall of the grill compartment. 
Once the animal got into the safe compartment, 
retracing was prevented by closing the interven- 
ing door. 

During training, escape animals always landed 
on a charged grill, whereas avoidance animals 
landed on an uncharged grill which was auto- 
matically charged one and one-half seconds later. 
Sounds from the timer controlling this interval 
were muffled by enclosing the timer in a sound- 
proofed box, but as an additional precaution 
against cue differences, the timer was kept in 
operation (nonfunctionally) with escape animals 
and throughout extinction. 

Training was continued for 40 trials, at which 
time the shock circuit was turned off and extinc- 
tion trials began without interruption. Aside 
from turning off all shock, the only change in 
procedure during extinction was that the animal 
was removed from the apparatus if he failed to 
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get off the grill within 10.0 sec. All animals 
received 10 extinction trials, after which an 
animal was discontinued if he stayed on the grill 
for more than 10 sec. on each of two successive 
trials. 

As a practical limitation no animal was con- 
tinued for more than 60 extinction trials (two 
hours). 

The one measurement taken on all trials was 
the time between landing on the grill and step- 
ping onto the ‘safe’ floor. This measurement 
was read to the nearest 0.1 sec. from the chron- 
oscope that was started and stopped by the 
switches on the two respective hinged floors. 


RESULTS 


The course of learning and extinc- 
tion in the two groups are depicted in 
Fig. 1. The curves use an amplitude 
measure: speed of locomotion in feet 
per sec. (as computed from the time 
measure and the constant distance 
traversed). Partly because the 20- 
sec. limit during training and the 
10-sec. limit during extinction made 
means indeterminate, the basic meas- 


ure of central tendency used in Fig. 1 


is the median. ‘To smooth the curves 
and at the same time reveal the im- 
portant changes, the trial to trial 


medians are averaged by successive 
pairs during the first 10 trials of 
training, by successive tens during 
the rest of training, and again by pairs 
during the first 10 extinction trials. 
Fig. 1 exhibits three important and 
statistically reliable trends: 


1. A greater amplitude of condi- 
tioned running in the escape group 
during training, especially in later 
training trials. 

2. A gradual weakening in the 
amplitude of the initial avoidance 
responses during later portions of 
avoidance training. 

3. A greater amplitude of condi- 
tioned running in the avoidance group 
during extinction. 


In evaluating the reliability of the 
trends shown in Fig. 1, ordinary t-tests 
on mean time or mean speed were not 
used because of the indeterminate 
means. The alternative statistical 
tests used are indicated in each of the 
following sections devoted to the 
statistical evaluation of the trends 
depicted. 
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Strength of instrumental locomotion in rats trained by the escape compared with the 
avoidance procedure, during training and during extinction. 
in feet per sec., averaged over the indicated trials. 


The measure plotted is median speed 
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TABLE I 


Mean Numer or Responses wits Response-Trues or 0.7 Sec. on Less 
Durinc THe 40 Tratninc TRIALs 











Animal Pair Number 





6 





Escape 
Avoidance 


5 
° 





Difference 

















5 























t = 1.8, df = 9, P = .05 (one tail) 


Difference between escape and avoid- 
ance during training 


One way of dealing with time meas- 
ures that have an indeterminate mean 
is to set a time criterion of what con- 
stitutes a response, give each subject 
a score according to the number of 
such responses in a fixed number of 
trials, and perform a ¢-test on these 
frequency scores. This procedure 
was adopted in the present case. The 
time value chosen was 0.7 sec. (cor- 
responding to an amplitude of about 
3-1 feet per sec.). This criterion was 
used because, to the nearest 0.1 sec., 
0.7 was the median response-time for 
all 800 responses during training. 
Using the median as the criterion pro- 
vides a meaningful way of distinguish- 
ing ‘more vigorous’ and ‘less vigorous’ 
responses; it also has the advantage 
of placing the mean number of re- 


sponses close to the center of the 
range, making the correlation between 
mean and sigma with enumerative 
data relatively unimportant in its 
effect on the validity of the ¢-test. 
Table I shows the results for all 40 
training trials. In this case the range 
of scores is from 0 to 40 and the mean 
is close to 20. 

The difference is in the expected 
direction but is not highly reliable; 
differences as large would occur in 
this direction by chance about five 
times in 100. The difference in per- 
formance is more reliable if one con- 
siders only the later phases of training 
—-in which the effect of shock omission 
in avoidance training is most strongly 
felt. The results for the last ten 
training trials are shown in Table II. 
Here the same criterion is used inas- 
much as 0.7 sec. was also the median 


TABLE II 


Mean Number or Responses with Response-Times or 0.7 Sec. or Less 





Durinc THe Last 1o Traininc TRIALS 








Anima! Pair Number 








Escape 
Avoidance 


10 








Difference 9 











| 




















t = 2.7, df = 9, P = .o1 (one tail) 
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TABLE Ill 


Mean Number or Responses wita Response-Times or 0.7 Sec. on Less 
Amonc Avormance Animats Durinc Tramninc TRIALS 11-20 AND 31-40 








Avoidance Animal Number 





4 s 


6 





Trials 11-20 5 9 
Trials 31-40 r 1 I 


° 








| 


Difference 4 | 8 











| 
= | 
| 


| 
;} oO 
! 





| 
| 





t= 2.3, df = 9, 


response-time. As seen in Table II, 
the mean difference for the last 10 
training trials is in the expected di- 
rection, is relatively large, and is 
quite reliable. 


Weakening of avoidance response in 
later training 


As expected from the theoretical 
analysis, the vigor of the response 
among avoidance animals was seen in 
Fig. 1 to decline during the later 
portions of the training. This change 
in performance was evaluated for 
statistical significance by a compari- 


10 


9 
AVOIDANCE 


-_——" 
eo 


MEAN NUMBER OF RESPONSES 





iL 


= .02 (one tail) 


son of the last 10 trials with the second 
10 trials among these animals. The 
same criterion of a ‘response’ was 
used as in Table II, i.e., a response- 
time of 0.7 sec. or less. This was 
somewhat faster than the median but 
permitted comparability with Tables 
I and II. The results are shown in 
Table III. Here each avoidance ani- 
mal is paired with himself, his per- 
formance on trials 11-20 being com- 
pared with his performance on trials 
31-40. It can be seen that the ex- 
pected decline in vigor is reliable at 
the two percent confidence level. 


ego epee ooo 
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Fic. 2.—Mean number of conditioned runs during the first 10 extinction trials, shown as a function 
of the response-time used as a criterion of whether or not a conditioned run occurred 
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TABLE IV 


Number or Responses with Response-Times or 3.0 Sec. orn Less 
Durinec THe First to Extinction Triats 








Animal Pair Number 





a4 








Avoidance 8 
Escape 2 








Difference > | 6 











= 




















t = 2.7, df = 9, P = .o1 (one tail) 


Difference between escape and avoid- 
ance during extinction 


Fig. 2 shows complete results for 
mean number of ‘responses’ during 
extinction as a function of the cri- 
terion of a ‘response’ selected. Pres- 
entation of complete curves is desir- 
able, since it is to be expected that 
there will be a superiority of escape 
training for very fast responses, no 


difference for very slow responses, and 
a superiority of avoidance training at 
some empirically determined inter- 
mediate value. 

It can be seen in Fig. 2 that the 
mean score of the avoidance animals 


during extinction is considerably 
larger than that for escape animals for 
all response-times except the very 
fastest. The reversal of the two 
curves at the fast response-times re- 
flects the expected result that at the 
end of training the performance of the 
escape animals is enough more vigor- 
ous than that of avoidance animals 
for the majority of the relatively few 
instances of very fast response-times 
to come from the escape animals. 
The two curves would presumably 
converge again at 100 percent if a 
sufficiently long time criterion were 
used. 

The probability values shown in 
Fig. 2 for the differences between 
means were computed from the dis- 


tributions of 10 differences between 
scores of individual members of the 
10 pairs in a manner analogous to that 
used in Tables I, II and III. The 
result at the three-sec. criterion of 
response-time, which was approxi- 
mately the point with the greatest 
t-value, is presented in Table IV.* 

The results on extinction in Fig. 2 
and in Table IV compare perform- 
ances over the first 10 extinction trials, 
during which trials a time measure was 
obtained from every animal. An- 
other way of comparing the resistance 
to extinction in the two groups is in 
terms of trials to reach a criterion of 
extinction. This is shown in Table 
V, which gives the number of trials to 
reach the criterion of two successive 
response-times of 10 sec. each. Two 
avoidance animals did not reach this 
criterion in the practical limit of 60 
extinction trials, so medians are shown 
rather than means in Table V. Since 
most of the avoidance animals did not 
receive their last shock on the last 
training trial, the number of unrein- 
forced trials to reach the extinction 
criterion is also shown (in parenthesis) 
for avoidance animals. 


* Because the validity of the t-test might be 
questioned with the data in Tables II, III, and 
IV, Fisher’s (2, pp. 43-47) size and sign test ‘of a 
wider hypothesis’ was also employed. In each 
case the P-value obtained with the t-test was 
increased by only .o1. 
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Since two avoidance animals have 
indeterminate scores, the mean differ- 
ence test cannot be used on trials to 
reach the extinction criterion. Nor 
would it be particularly appropriate 
with such large qualitative differences 
in performance. To evaluate the 


‘statistical significance of the difference 


in performance in trials to reach the 
criterion the following procedure was 
therefore employed. Inspection of 
Table V_ reveals that the median, 
counting from the 4oth trial, for the 
entire group of 20 animals is 17.5. 
On the null hypothesis it is expected 
that half of the escape animals would 
fall above and half would fall below 
this value and similarly half of the 
avoidance animals should fall above 
and half below this value. In con- 
trast with this chance expectation, 
eight of the ten escape animals were 
below 17.5 and eight of the ten avoid- 
ance animals were above this value. 
Using chi-squared and the Yates (10) 
correction for continuity, the prob- 
ability of obtaining a discrepancy as 
large or larger in the same direction as 
the obtained one is .o12, which indi- 
cates a high degree of reliability for 
the difference in performance. It 
should be borne in mind that this test 
is based on the results beginning with 
the first extinction trial after the 40 
training trials. In all but two avoid- 
ance animals this was merely a con- 
tinuation of a series of non-reinforce- 
ments that had started before the last 


training trial. If, instead, one counts 
from the last reinforcement, the differ- 
ence can be seen from Table V to be 
much greater and is correspondingly 
more reliable. To take an extreme 
case the avoidance animal in pair No. 
10 received his last reinforcement on 
the sixth training trial, so he had re- 
sponded without reinforcement for 
34 consecutive trials prior to the 22 
consecutive responses during ‘extinc- 
tion.” Thus, he actually gave 56 
successive unreinforced responses be- 
fore extinguishing, which was more 
than his running mate’s 51. The 
results are made even more impressive 
by computing the number of unrein- 
forced responses obtained per rein- 
forcement, which was over 22 times 
as great for the 10 avoidance animals 
if one uses the 1o-sec. criterion and 
both training and extinction trials. 


Relative ‘drive’ value of immediate and 
delayed shock 


A methodological point that seems 
worth noting is that delayed shock 
appeared to produce more activity 
than immediate shock. In the curves 
shown in Fig. 1, there is the appear- 
ance of almost exact equating of the 
two groups at the outset of training, 
with only a tiny advantage to the 
shock which was delayed for 1.5 sec. 
However, the results in Fig. 1 are ex- 
pressed in median speed, which tends 
to discount any differences involving 
the large time measures characteristic 


TABLE V 


Triats To Reacu THe Crirerion or Extinction 











Animal Pair Number 








Avoidance: 
After goth trial 
(After last reinf.) 
Escape 


| 60+ 18 | 27 








6} 15 
| ie 


21 
(74+) (22) | (27) (30) 
10 


| 
| 12 | 22 

(15) | (56) 
13 2 | 51 


|} 28 | 2 60+ | 30 
(37) | (2) | (66+)) G3) 
25 3, 17 | 
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of the outset of training. In terms of 
time rather than speed the median 
averaged about six sec. for the first 
two trials of avoidance but was over 
10 sec. for escape. Here the critical 
difference is not avoidability or un- 
avoidability but merely the delay of 
1.5 sec. in the administration of shock 
in the avoidance group; at this stage 
of training both groups were learning 
to escape shock. The evidence sug- 
gested that delayed shock was a better 
stimulus for trial and error activity 
than immediate shock. This is per- 
haps best shown by comparing the 
two conditions according to the num- 
ber of times the animals had to be 
pushed off the grill because of failure 
to get off on their own within the 20 
second limit allowed during training. 
The mean was 5.0 for immediate 
shock and only o.7 for delayed shock. 
However, only four of the ten pairs 
were chiefly responsible for this rela- 
tively large difference and it was not 
very reliable. Using the t-test, the 
chances were about six in 100 of get- 
ting as large a difference in either di- 
rection by chance. If the phenom- 
enon actually exists, it appears to be 
of an ‘all-or-none’ character that 
would require more pairs to be placed 
outside the range of chance. The re- 
sults suggest but do not prove that 
‘freezing’ is more likely to be evoked 
when the animal is dropped on an 
already charged grill than when the 
shock is slightly delayed. 


Discussion 


The results conform in all major 
respects to the theoretical analysis. 
Escape training produced a response 
that was more vigorous at the end of 
training but that extinguished more 
rapidly than the habit produced by 
avoidance training. Further, avoid- 
ance training showed the expected de- 
cline in vigor as training continued. 
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This is comparable to a previously 
reported finding (8) that as avoid- 
ance training—and certainty of a 
minimal response—increased, the am- 
plitude of the avoidant response 
decreased. 

The results help to clarify the tind- 
ings in an experiment by Brogden, 
Lipman, and Culler (1) in which 
avoidance training led to better per- 
formance during acquisition than 100 
percent reinforcement with the same 
noxious stimulus. Their 100 percent 
‘reinforcement’ procedure used a pain 
stimulus of a fixed, brief duration re- 
gardless of the behavior of the animal 
and did not guarantee performance of 
the ‘unconditioned’ response, as shown 
elsewhere (8). In the present experi- 
ment, in which escape insured that the 
correct response was performed on 
almost every trial, the avoidance pro- 
cedure produced poorer performance 
during training than did the compar- 
able 100-percent reinforcement pro- 
cedure.® 

The results are contrary to the 
interpretation offered by Mowrer and 
Lamoreaux (6) that escape from fear 
is more reinforcing than escape from 
both fear and shock. The reversal, 
obtained in the present experiment, in 
performance during acquisition and 
extinction is inconsistent with the 
position of Mowrer and Lamoreaux, 
although readily explained by taking 


* Non-interference of shock with the running 
response was not completely achieved. Animals 
in the escape group occasionally ‘froze’ on the 
grill after a long series of successful escape re- 
sponses, and of the total group of 20 animals the 
two giving the poorest performance during train- 
ing were both escape animals. Also when omis- 
sion of shock weakened the response during 
training of avoidance animals they often got 
shocks just as they crossed the threshold of the 
safe compartment, which appeared to engender 
some hesitation at this point. It seems likely 
that both factors could be reduced or eliminated 
by using a shorter grill or a jumping response 
with a small grill platform. 
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into account the effect of the avoid- 
ance procedure in conditioning the 
response to non-reinforcement cues. 
Mowrer and Lamoreaux based their 
interpretation in part on the incorrect 
assumption that Brogden, Lipman, 
and Culler used the escape procedure 
in their 100-percent reinforcement 
group; actually the shock was of a 
fixed duration of 0.1 sec. (private 
communication from Dr. Brogden). 

The reversal in performance of the 
two groups when acquisition and ex- 
tinction are compared also demon- 
strates that amplitude during training 
and extinction do not simply reflect 
‘strength’ of conditioning; both must 
be interpreted in terms of the nature 
of the cues to which the response is 
connected. The results fit the inter- 
pretation that the avoidance response, 
while weaker in vigor than the escape 
response at the end of training, is more 
resistant to extinction because it has 
been connected during training to the 
novel cue patterns concomitant with 
omission of reinforcement. 

The relative performance during 
training and the relative resistance to 
extinction of the escape and avoid- 
ance procedures would be expected to 
vary as a function of several factors 
not studied in the present experiment. 
One such factor is the number of train- 
ing trials. With brief training the 
two procedures should be maximally 
similar in effects both during training 
and during extinction, whereas the 
longer the training the more they 
should diverge in the directions found 
in the present experiment. 

Another factor is the time interval 
by which the noxious stimulus is de- 
layed in avoidance training. Long 
time intervals involve delayed condi- 
tioning and would slow down acquisi- 
tion under avoidance training and pro- 
duce minimal vigor of response; 
however, they might lead to ultimately 


greater resistance to extinctionbecause 
the long hesitations characteristic of 
late phases of extinction would be cue 
patterns reinforced during training. 
Shorter delays would speed acquisition 
and make for greater vigor of the 
avoidance response at the end of 
training since cues from very brief 
hesitation would become conditioned. 
For example, starting to groom would 
provide the cue pattern for running 
with brief delays whereas more pro- 
tracted grooming would provide the 
cue pattern with long delays. 

A third factor is the intensity of the 
painful stimulus used. If severe pain 
and only a few training trials are used, 
both escape and avoidance training 
may show a continued increase in cer- 
tainty of response rather than typical 
extinction when shock is eliminated. 
Miller (5) has shown that fear alone 
can be used as a basis for instrumental 
learning, and if conditioned anxiety 
is maximized by intense pain, both 
groups might show parallel vigor and 
increasing certainty during initial 
phases of ‘extinction.’ Thus, with 
brief training intense pain might ob- 
scure the difference between the two 
procedures, although sufficient train- 
ing should ultimately reveal the cue 
differences even with intense pain. 

It should be pointed out that 
basically the present stiraulus-response 
analysis is not restricted to a com- 
parison of escape and avoidance train- 
ing but is generally applicable to any 
comparison of 100-percent reinforce- 
ment and partial reinforcement. A 
similar analysis has been successful 
(9) in predicting effects of 50-percent 
random reinforcement with food as 
reward. The central concept is that 
all partial reinforcement procedures 
connect the response to cue patterns 
characteristic of omission of reinforce- 
ment, cue patterns which normally 
contribute—through _ generalization 
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decrement—to loss of response during 
extinction after 100-percent reinforce- 
ment. Any procedure which intro- 


duces some omission of reinforcement 
during acquisition should tend to 
weaken the response during training 
but prolong extinction of the response. 


The foregoing interpretation of avoid- 
ance training should be contrasted with 
a possible ‘cognitive’ interpretation to 
the effect that what the animal learns is 
an ‘expectation’ that pain will come if he 
does not respond in a particular way 
within a particular time limit. With a 
few additional assumptions it may be 
predicted—in line with the facts—that 
the ultimate avoidance response would 
involve minimal expenditure of energy, 
and would be very resistant to extinction 
because the expectation learned prevents 
the animal from discovering that the 
experimenter has arbitrarily decided to 
turn off the shocking device. Such an 
interpretation is meaningful to laymen 
and parsimonious in words used to pre- 
sent it. 

However, it is believed that such an 
‘explanation’ is really just a statement of 
the cause and effect relations that exist 
in the environment, together with the 
circular assumption that animals avoid 
anticipated noxious effects. Humans 
often thus verbalize the observed course 
of events in the environment and there- 
fore can get extra practice (symbolically) 
of the same behavior they would perform 
under such circumstances and would 
eventually acquire even without words. 
But it is believed that the attribution of 
verbal symbols—or equivalent symbolic 
activity—to animals at the level of the 
rat is anthropomorphic. The _ rat’s 
‘knowledge’ of the environmental facts 
of avoidance training is probably re- 
stricted to an impulse to give the instru- 
mental response when the cues preceding 
shock are presented, an impulse which 
gets stronger when cues for loitering are 
added to the pattern. The stronger 
impulse is based on the fact that cues of 
loitering have invariably been followed 
by pain and fear and the response which 
terminates these. It seems likely that 
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much of human avoidance learning is at 
the same non-cognitive level. 


SUMMARY 


1. One group of animals (escape 
group) was trained to run off a grill by 
placing them on a charged grill and 
allowing escape to an adjacent safe 
compartment; the other group (avoid- 
ance group) was treated in the same 
way except that an interval of 1.5 sec. 
was allowed for them to get off the 
grill prior to shock. 

2. Animals in the avoidance group 
were: poorer in performance at the end 
of training than escape animals; 
poorer in performance at the end of 
training than their own performance 
earlier in training; more resistant to 
extinction than escape animals. 

3. The findings support the hy- 
pothesis that part of typical extinction 
is due to the alteration of the condi- 
tioned stimulus pattern when the con- 
ditions are shifted from those of rein- 
forcement to those of non-reinforce- 
ment and that avoidance training 
offsets this factor by conditioning the 
instrumental response to cues char- 
acteristic to a series of non-reinforced 
trials. 


(Manuscript received May 4, 1949) 
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AN EXPERIMENTAL TEST OF THE SELECTIVE 
PRINCIPLE OF ASSOCIATION OF 
DRIVE STIMULI * 

BY HOWARD H. KENDLER 
New York University 
AND 
FLORENCE E. LAW 
University of Colorado 


Hull (2) and Leeper (9) have reported 
rats are capable of acquiring differential 
habits based solely on internal conditions. 
Using single choice point mazes, these 
investigators discovered that their ani- 
mals learned to turn in one direction 
when hungry and in another direction 
when thirsty. Hull (3) has attempted to 
explain this phenomenon by assuming 
the animals learned to respond to two 
different patterns of stimuli, i.c., they 
learned one turning habit to the stimulus 
compound consisting of the choice point 
and Aunger drive stimulus and a different 
turning habit to the choice point and 
thirst drive stimulus compound. 

Kendler (6) tested this interpretation 
by performing a similar experiment in 
which the animals were motivated simul- 
taneously by doth hunger and thirst dur- 
ing the training series. Strict adherence 
to Hull’s analysis with no other factors 
considered would have led one to expect 
failure of such discrimination behavior 
on test trials with single motivation, for 
the reason that there was only a single 
stimulus pattern to which both spatial 
responses became conditioned. Since 
both the hunger and thirst drive stimulus 
components were present in this stimulus 
pattern, each would, according to this 
line of reasoning, become associated with 
both spatial responses. The fact that 
the Ss, when motivated either by hunger 
or thirst, during the test series evidenced 


* The first study reported in this paper was 
conducted by the senior author in the Psychologi- 
cal Laboratory of the State University of Iowa. 
The second study was completed at the Univers- 
ity of Colorado. It was one of several studies 
contained in the junior author’s master thesis (8). 


appropriate discrimination behavior (i.e. 
went to water when thirsty; to food, when 
hungry) suggested that the explanation 
offered by Hull was inadequate. 

It has been assumed by most associ- 
ation theorists, including Hull, that a// 
stimulus components impinging upon 
the sensorium at the time of a response 
become associated with that response 
(assuming, of course, that the necessary 
conditions of association formation are 
met). Kendler (6) suggested an alter- 
native assumption which involved a 
selective principle of association of drive 
stimuli. The specific hypothesis offered 
was “that only those drive stimuli which 
are themselves reduced become connected 
to a rewarded response.” His experi- 
mental results tended to support this 
new hypothesis, because it would be 
assumed that when the animal, both 
hungry and thirsty, had turned left to 
attain food, only the hunger drive stimu- 
lus became connected to that response. 
In a similar manner when the hungry 
and thirsty animal had turned right to 
water, only the thirst drive stimulus 
would be expected to become connected 
to that turning response. Appropriate 
discriminatory behavior during the test 
series, consequently, would be antici- 
pated. 


The present paper reports the re- 
sults of two studies which were de- 
signed to test further the adequacy of 
the selective principle of association 


of drive stimuli. The experimental 
designs were rather simple. In the 
first study two groups of albino rats 
were trained to go to that side of a 
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T-maze which contained food. One 
group (Group H&T) was hungry and 
thirsty while the other group (Group 
H) was just hungry. After reaching 
the criterion of learning to go to food, 
both groups were made thirsty and 
food was placed in the end-box which 
had previously been empty. The test 
was to determine which group would 
learn to go to water more quickly. 
According to the non-selective princi- 
ple of association of drive stimuli, the 
hungry and thirsty group should be 
retarded in learning to go to water 
because the thirst drive stimulus 
should have a tendency to evoke the 
response to the food side. However, 


according to the selective principle, no 
retardation should be expected for the 
reason that the thirst drive would not 
have become connected to the re- 
sponse leading to food. The second 
study was essentially a duplicate of 
the first except that the subjects ini- 


tially were trained to go to the side of 
the T-maze containing water. Dur- 
ing this time one group was motivated 
by thirst and hunger (Group T&H) 
and the other just by thirst (Group T). 
Following the learning of this problem 
the motivation of both groups was 
shifted to hunger. As in the first 
investigation, the expectation, ac- 
cording to the selective principle, 
would be that there should be no 
essential difference between the groups 
in learning to go to food. The non- 
selective principle would generate the 
deduction that the thirst and hunger 
group would be retarded. 


EXPERIMENTAL ProcepuRES 
First Study 


The Ss were 32 male albino rats from the 
colony maintained by the Department of Psy- 
chology of the State University of lowa. Their 
ages ranged from 106 to 138 days and they all 
had previous experience in a Skinner box. 
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First problem.—A single choice T-maze was 
used. A complete description of it is presented 
in a previous paper (3). The Ss were tested for 
position habit by allowing them three free 
choices. Three animals that chose the same side 
for all three trials were discarded. The rats were 
divided into two groups of 16 Ss; each group 
having different motivating conditions during 
the learning trials. The division was based on 
the records of the test trials for position habit. 
Group H (hungry) was deprived of food for 22 
hours while Group H&T (hungry and thirsty) 
was deprived of both food and water for the same 
period of time. Food, in the form of 10 large 
pellets of Purina Dog Chow, was present in one 
of the goal boxes, while the other remained 
empty. Food was always placed in the goal box 
on the side which the animal chose only once 
during the three non-rewarded free choice trials. 

The specific details of the feeding schedule for 
Group H was as follows; the animals were given 
one and a half pellets of dog chow (nine gm.) 
approximately 15 min. after its experimental 
session. The time from complete consumption 
of this food to the beginning of the next day’s 
trial was approximately 22 hours. The water 
bottles for Group H were filled and always 
available. Group H&T had the same regimen 
except that their water bottle was removed after 
finishing one pellet of food. The remaining half 
pellet was then given. This group (H&T) there- 
fore had approximately 22 hours of deprivation 
of both food and water. 

Each S was given three trials per day until 
the criterion of learning, four days of correct 
trials (12 successive correct trials) was attained. 
When the animals chose the correct side they 
were allowed to eat for 20 sec.; when they chose 
the wrong side they were kept in the empty end- 
box for an equivalent period. At least 15 min. 
elapsed between successive trials of the same S. 

Second problem.—Following the attainment 
of the criterion of learning to go to the food, the 
animals of both groups were made thirsty and 
trained to go to the side opposite to the one in 
which they received food. The thirst drive was 
manipulated by permitting the animals access to 
water and food for one-half hour. This feeding 
and drinking period started about 15 min. after 
the last criterion trial. At the end of 30 min. 
the water was removed and a large amount of 
food was inserted. Approximately one-half 
hour before the trials of the following day dishes 
of kibbled meal were introduced to induce further 
eating. By such a method the Ss in both groups 
were motivated by approximately 22 hours of 
water deprivation and were satiated for food. 
Water was available in that end-box which was 
empty during the first part of the investigation. 

The training schedule under thirst motivation 
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was similar to that existing during the initial 
stage of the study. The Ss received three trials 
daily. When making the correct choice, they 
were permitted to drink for five sec.; following an 
incorrect choice they were kept in the empty 
end-box for 20 sec. The criterion of learning 
was four successive days of correct trials. 


Second Study 


The Ss used in this study were 28 naive female 
albino rats. They were offspring of the female 
rats of the Sprague-Dawley strain mated with 
male rats from the University of Colorado’s psy- 
chology department’s strain. At the beginning 
of the study their ages ranged from 120 to 130 
days. 

First problem.—The experimental procedure 
was similar to that used in the first study except 
for the motivational condition of the animals 
and the reward used. One end-box of the maze 
(in this study painted a flat black) contained a 
glass coaster filled with water while the other 
end-box remained empty. Group T was 
motivated by 22 hours of water deprivation 
while satiated for food, while Group T&H had 
a drive state based upon 22 hours of deprivation 
of both food and water. The regimen was as 


follows: 15 min. after the completion of their 
daily trials the animals of both groups were given 


their rations of Purina Dog Chow checkers and 
kibbled meal. Large water receptacles were 
also inserted into the home cages at this time. 
One hour later the water receptacles and the 
remaining food were removed from the cages. 
On the following day, one hour prior to the train- 
ing trials, a pan of dry kibbled meal was inserted 
into the cages housing the animals in Group T. 
No commodity was presented to Group H&T at 
this time. This regimen was repeated on suc- 
cessive days. 

In the training series, during which time the 
animals were given three free choice trials daily, 
the Ss were permitted to drink for five sec. 
following a correct response and were detained 
in the empty goal box for 20 sec. following an 
incorrect choice. 

Second problem.—On the day following the 
achievement of the criterion of learning the ani- 
mals were motivated by food deprivation while 
satiated for water. This change in drive was 
manipulated by permitting the animals to con- 
sume water and food for one hour, beginning ap- 
proximately 15 min. after the last criterion trial. 
After this hour period all the remaining food was 
removed while the water receptacle remained. 
One hour prior to the next day’s trial a pan of 
water was inserted in all cages to induce further 
drinking. 


Resutts AND Discussion 


The results of the first problem of 
both studies suggest that the effect 
of a relatively strong irrelevant drive, 
whether it be hunger or thirst, is to 
retard animals in reaching a learning 
criterion in a simple T-maze situation. 

Table I indicates the mean number 
of errors' and reinforcements for 
Group H and H&T to reach the cri- 
terion of learning to go to food. Since 
the distributions of the errors were 
highly skewed, the Festinger test of 
significance (1) of. differences between 
means was used. The results were 
significant at about the five percent 
level. The difference between rein- 
forcements did not approach statistical 
significance. 

Table II reveals the retarding 
effect of an irrelevant hunger drive. 
In fact, the retarding influence of the 
hunger drive was so great, that seven 
of the Ss in Group T&H failed to 
reach the criterion of four successive 
days of perfect runs after 55 days of 
training. It had been decided when 
it became evident that several Ss in 
Group T&H were not going to achieve 
the criterion that training would be 
terminated after 55 days. The data 
of those Ss in Group T&H who 
reached the criterion are treated 
separately from those who did not. 
The former is referred to as T&H (C) 
while the latter is identified by T&H 
(No C). A comparison of Group T 
with either of the T&H sub-groups 
indicated that the differences in errors 
and reinforcements were significant 
beyond the one percent level. 

Although the results suggest an 
irrelevant hunger drive in a thirst- 
water problem has a greater retarding 
effect than an irrelevant thirst drive 
in a hunger-food situation, it should 


! These results and their implications were 
presented in a previous paper (4). 
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be recognized that the two studies 
reported here involved two different 
strains of albino rats. Such a con- 
clusion would obviously have to be 
based on investigations involving ani- 
mals with an identical hereditary 
background. 

Additional evidence which indicates 
that an additional drive can retard 
learning is contained in a paper by 
Muenzinger and Fletcher (10). 
These investigators trained two groups 
of rats in a visual discrimination box. 
One group was motivated by escape 
from shock while the other group was 
motivated by hunger and escape from 
shock. The results revealed that the 
group motivated by both hunger (ir- 
relevant drive) and shock-escape made 
morg errors in reaching the criterion 
of ‘garning than did the group moti- 
vated by shock-escape. 

The data directly relevant to the 
theoretical issue tested are contained 
in Tables III and IV. According to 
the non-selective principle of associ- 
ation of drive stimuli the groups 
possessing an irrelevant drive during 
the first problem should be retarded 
in learning the second problem. The 
irrelevant drive of the first problem, 
which becomes the relevant drive of 
the second study, should have a tend- 
ency to evoke the correct response of 
the first problem. Since this response 
would be the incorrect response dur- 
ing the second problem, learning 
should be retarded. According to the 
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TABLE I 


Mean Numper or Errors ann Retnrorce- 
ments TO Carrerion or LEARNING 
Dvurinc Fiast Prostem (To 
Water) or Seconp Stupy 








Errors | Reinforcements 


Group 





Mean o Mean o 





7.00} 2.56 
24.28 | 10.40 


16.00} 11.44 
42.57) 17.61 
17.85 


| 
T&H (C) 
T&H (No C)* 








56.86 17-88 | 108.14 





* The data for Group T&H (No C) refer to 
their performance during their 55 days of train- 
ing in which time they did not achieve the 
criterion of learning. 


selective principle no difference be- 
tween the two groups would be 
expected. 

The results tend to be in agreement 
with the implications of the selective 
principle and at variance with the 
expectations of the non-selective prin- 
ciple. Table III reveals no evidence 
of retardation on the part of Group 
H&T in mastering the second prob- 
lem. With the use of the Festinger 
technique (1), neither of the differ- 
ences (errors or reinforcement) be- 
tween Groups H&T and T approached 
statistical significance. The data pre- 
sented in Table IV tend also to be in 
agreement with the selective principle. 
No significant differences in mean 
number of reinforcements or errors 
were obtained between Groups T and 
T&H(C) or between Groups T and 
T&H(No C)? 

One might attempt to reconcile 
these findings with the non-selective 
principle by attributing the lack of 


The mean number of errors and reinforce- 
ments in Group T was raised appreciably by one 
animal. All other members of Group T had 
reached the criterion of learning in the second 
problem by the 38th day. This animal required 
86 days. By excluding the record of this animal 
the mean number of reinforcements became 36.23 
while the mean number of errors was changed to 
14.77. 
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any significant difference between the 
groups to the Hullian mechanism of 
afferent neural interaction. By ac- 
cepting such a position one would 
assume in the first study that during 
the first problem the irrelevant thirst 
drive stimulus did acquire some habit 
strength to evoke the response to food. 
However, this habit strength was 
acquired by the thirst drive stimulus 
when it was present in conjunction 
with the hunger drive stimulus. This 
attempted explanation would rest 
upon the assumption that little or no 
habit strength was generalized from a 
compound involving both a 22-hour 
thirst drive stimulus and a 22-hour 
hunger drive stimulus to a 22-hour 
thirst stimulus alone. This explana- 
tion is quite different from the one 
involving a_ selective principle. 
Whereas the selective principle as- 
sumes no habit strength is acquired 
by the irrelevant drive stimulus, the 
‘afferent neural interaction assump- 
tion’ assumes that some habit strength 
is acquired, but none is generalized. 
Unfortunately we have no generaliza- 
tion curves of a compound involving a 
drive stimulus which remains con- 
stant while other components vary. 
However, from our present knowledge 
of stimulus generalization, it would 
appear very unlikely that none or 
very little habit strength would be 
generalized in our experimental situ- 
ation if the non-selective principle 
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TABLE IV 
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were true. Some confirmation of 
this viewpoint is contained in the re- 
sults of an experiment by Webb (11). 
He trained two groups of animals to 
depress a bar in a Skinner box situ- 
ation. One group was motivated by 
hunger and thirst, the other by only 
hunger. During the extinction series 
both groups were motivated by hunger 
and thirst. No difference was found 
between the extinction rates of the two 
groups, i.e. the change from a hunger 
drive stimulus to hunger and thirst 
drive stimuli did not produce any loss 
of habit strength. 

Webb’s results, of course, would be 
consistent with the selective principle. 
Such a principle would assume that 
the irrelevant drive would not be- 
come associated (not acquire any 
habit strength) and consequently no 
difference between the extinction rates 
of the two groups would be expected. 
Kendler (4,7) has reported other 
evidence which tends to support the 
selective principle. All of the above 
mentioned studies, it should be rec- 
ognized, have involved only hunger 
and thirst drives. In this area, at 
least, the selective principle appears 
more valid than the non-selective 
principle. 

SuMMARY 


The purpose of these studies was to 
test the selective principle of associ- 
ation which asserts that only those 
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drive stimuli which are themselves 
reduced become connected to a re- 
warded response. 

In the first study Group H (hun- 
gry) and Group H&T (hungry and 
thirsty) were trained in a single unit 
T-maze to choose the side leading to 
food (Problem 1). Following the at- 
tainment of the criterion of learning in 
Problem 1, the Ss in each group were 
trained to reverse their position pref- 
erence (Problem 2). This was ac- 
complished by making the animals 
thirsty and placing water in the previ- 
ously unbaited goal box. The second 
study was a duplicate of the first study 
except that during Problem 1 the 
animals were motivated either by 
thirst (Group T) or thirst and hunger 
(Group T&H) and the baited goal box 
contained water. In Problem 2 the 
prevailing motivation was hunger; 
food was placed in the previously 
empty goal box. 

No significant differences were ob- 
tained between the two groups, in 
each study, in reaching the criterion 
of learning in Problem 2. 

If all drive stimuli became con- 
nected to a correct response in accord- 
ance with the non-selective principle 
of association, then the groups poss- 
essing the ‘irrelevant’ drive (Groups 
H&T and T&H) during Problem 1 
should be retarded in learning Prob- 
lem 2. This would follow because 
the cue from the prevailing drive 
state in Problem 2 (thirst in the first 
study, hunger in the second) would 
tend to evoke the correct response of 
Problem 1, which was the incorrect 
response for Problem 2. Since no 
retardation was evidenced, the data 
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are interpreted to be consistent with 
the selective principle of association 
of drive stimuli. 


(Manuscript received June 7, 1949) 
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RESISTANCE TO EXTINCTION AS A FUNCTION OF 
THE DISTRIBUTION OF EXTINCTION TRIALS! 


BY VIRGINIA FAIRFAX SHEFFIELD 
Yale University 


INTRODUCTION 


Most previous experimenters re- 
porting on the relation between rate 
of extinction and the distribution of 
extinction trials have concluded that 
extinction is faster when trials are 
massed than when they are spaced 
(1, 4,9, 13). This is in line with the 
implications of spontaneous recovery, 
which suggest faster extinction with 
massed than with, spaced extinction 
trials, since more recovery from the 
effects of nonreinforcement can occur 
between trials in the latter case. 
However, this result has not always 
been found. One experimenter (10, 
11) concluded that there is no rela- 
tionship between rate of extinction 
and interval between extinction trials. 
Another experimenter (12) concluded 
that massed extinction is faster fol- 
lowing spaced training, but that there 
is no consistent difference between 
massed and spaced extinction follow- 
ing massed training. 

The results of the present study are 
of special interest because they are at 
variance with previous conclusions on 
the subject. Other experimental re- 
ports on this topic will later be dis- 
cussed in relation to the present re- 
sults; and a hypothesis will be 


1 This article reports an incidental but impor- 
tant finding in a dissertation submitted to the 
faculty of the Department of Psychology of Yale 
University in partial fulfillment of the require- 
ments for the Ph.D. degree. The writer is in- 
debted to Dr. Neal E. Miller, under whose di- 
rection the research was conducted, and to Dr. 
Clark L. Hull and Dr. Carl I. Hovland, who 
served on the advisory committee. The major 
results of the dissertation have been reported 
previously (14). 


presented which may explain the 
present findings. 


MetTuop 


Seventy-two male albino rats about three 
months old were trained to run down an alley 
with food at the end of the alley as reinforcement. 
The strength of this response was measured by 
the time elapsing from the opening of the start- 
ing-box door until the animal entered the goal 
box. Half of the animals were given 30 training 
trials massed (15-sec. interval), and half were 
given 30 training trials spaced (15-min. interval). 
Each of these two training groups was divided 
for extinction, half receiving 30 massed extinction 
trials and half receiving 30 spaced extinction 
trials. Extinction was started 24 hours after 
the start of training, and the time intervals used 
for massed and spaced extinction trials were 15 
sec. and 15 min. respectively, exactly as in 
training. A feature of the design which is 
incidental to the present findings is that half of 
the animals were trained with reinforcement on 
every trial and half with reinforcement randomly 
on 50 percent of the trials. The effects of this 
factor have already been reported (14) and need 
not be considered here, since the two types of 
reinforcement are equally represented in all 
groups. The design of the experiment is shown 
in Table I. 


Details of apparatus and procedure 


Only those details of the procedure used which 
are considered especially relevant to this report 
are presented here. For a more complete de- 
scription of apparatus and procedure, the reader 
is referred to the previous article (14). 

The apparatus consisted of a four-foot alley 


connecting a starting box and a goal box. Dur- 
ing training small pellets of wet mash were pre- 
sented in a food cup in one corner of the goal 
box; during extinction the food cup was re- 
moved. A curtain prevented the animal from 
seeing whether the food cup was present or 
absent until he had entered the goal box. Two 
time measures were recorded automatically: 
starting time (time from the opening of the door 
of the starting box until the animal left it) and 
running time (time from leaving the starting box 
to entering the goal box); but, as will be noted 
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TABLE I 


Desicn or Tue Exreriment Suowinc 
Number or Cases * 








Extinction 





Massed 


I 9 
Training 


| 
| i 








Spaced 
Training 








9 
9 
9 
9 


9 
I 9 
| ul 9 








* I and II refer to 100-percent and 50-percent 
reinforcement respectively. (See text.) 


later, the two measures were combined in the 
statistical analysis. 

On the first day 10 massed pre-training trials 
were given in the apparatus; on the second day 
the 30 regular training trials were given; and on 
the third day the 30 extinction trials were given. 

The experimental session was started each 
day at the time the animals were accustomed to 
being fed. For two weeks prior to the experi- 
ment they had been fed a restricted diet of wet 
mash (seven gm. dry weight, each day). Since 
the training session extended over a much longer 
time for the spaced group than for the massed 
group, it was not possible, with the experimental 
setup used, to obtain strictly comparable hunger 
drive from group to group. However, the 
restricted diet maintained the hunger drive at 
such a high level at all times for all rats that 
it was felt that any variability in absolute level 
would have little effect on the results. Rats 
given massed training trials had not eaten for 
23 hours at the start of extinction. But since 
spaced training was spread through a period of 
about 7} hours, at the end of which time the 
animals were given the balance of their ration for 
that day, rats given spaced training trials had 
not eaten for 16} hours at the start of extinction. 
In an attempt to make the hunger drives some- 
what more comparable for the two groups, the 
space-trained animals were given a total food 
ration of only six gm. on the day of training 
instead of the usual seven gm. 

During extinction if either the starting time 
or the running time exceeded two min., the ani- 
mal was removed from the apparatus; but all 
animals were given the full 30 extinction trials. 
The animals spent the intervals between trials 
in their individual living cages. For spaced 
trials this interval was 15 min. as measured from 
the start of one trial to the start of the next. 
For massed trials this interval was 15 sec. from 
the end of one trial to the start of the next. In 
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the latter case, therefore, the interval between 
starts of two successive trials was somewhat vari- 
able due to variable running time, but it rarely 
went over two min. and was considerably less 
than two min. during training and early extinc- 
tion. 


RESULTS 


When extinction curves were 
plotted separately for the two time 
measures, starting time and running 
time, exactly the same pattern of 
results was obtained. Therefore, the 
two measures were combined into one 
measure of ‘response time’ in order to 
use a more stable measure and simplify 
the statistical analysis. 

Fig. I compares extinction curves 
for the two groups: those with massed 
extinction trials and those with spaced 
extinction trials. Mean response 
times could not be calculated on the 
extinction data because of the re- 
moval of some animals from the 
apparatus for exceeding the two- 
minute time limit. Median response 
times were also indeterminate, not 
only for plotting extinction curves 
(because on some trials more than half 
the animals exceeded the time limit), 
but also as a measure of individual 
animals’ performances for testing the 
significances of differences (because 
some animals exceeded the time limit 
on more than half their extinction 
trials). Therefore, a frequency meas- 
ure was employed, the number (or 
percentage) of trials on which per- 
formance equalled or fell below the 
combined median response time for 
the particular groups being compared. 
The overall median response time for 
the 72 animals on all 30 extinction 
trials was 16.3 sec., and Fig. 1 shows 
the percentage of responses which 
were equal to or below this value, 
averaged for each set of five trials. 
The corresponding percentage values 
for the last five training trials are 
shown as reference points. 





DISTRIBUTION OF EXTINCTION TRIALS 307 


$ $ $ é 


8 


% RESPONSES AT OR BELOW MEDIAN 








Fic. 1. Comparison of massed and spaced 
extinction, showing percentage of responses 
which equalled or were below 16.3 sec. (the over- 
all median response time). The reference points 
at the beginning are the values for the last five 
training trials. 


The initial points on the curves 
show that performance at the end of 
training was almost identical for the 
two groups. For the first 10 extinc- 
tion trials the curves fall off at fairly 
comparable rates. But thereafter 
there is a wide divergence, the spaced 
extinction curve dropping at a much 
faster rate than the massed extinction 
curve. 

Table II presents statistical tests 
of the significance of this difference. 
Because the divergence started after 


the first ten extinction trials, the sig- 
nificance test was made separately 
for all extinction trials and for the 
last half of the trials. The means in 
Table II for all 30 trials used the 
median cut-off point of 16.3 sec., 
obtained for all extinction trials of all 
animals combined; the means for the 
last 15 trials used the median of these 
15 trials, 25.3 sec., as the cut-off 
point. The statistical analysis treats 
the data as nine replications of the 
experiment, with eight degrees of 
freedom. This procedure was 
adopted because the method used for 
assigning rats to the eight original 
groups made the eight rats within the 
same replication more homogeneous 
than the nine rats within a given ex- 
perimental group. Therefore, in test- 
ing for the significances of the mean 
differences between the two major 
conditions (i.e., massed versus spaced 
extinction), the appropriate differ- 
ence was found among rats within each 
replication, and the mean difference 
was tested from the distribution 
of nine differences so obtained. This 
procedure removed from the estimated 
variance of mean differences any cor- 
relation due to the balancing proce- 
dure used to assign rats to groups. 
Since any difference anticipated from 
the results of earlier investigators 


TABLE II 


Mean Number or Responses Meetinc Criterion,* Massep Extinction 


ComPparReD witH Spacep 








Mean Score 





Massed Ext. | Spaced Ext. 


Diff. 





All 30 ext. trials 17.1 I 
(Md. = x sec.) 
Last 15 ext. trials 


97 
(Md. = 25.3 sec.) 








2.9 4-2 2.00 2.1 07 


5-3 4-4 0.89 4-9 <.01 




















* Le., mean number of responses equal to or below the median response time for the trials being 
considered. 


** For both tails of distribution. 
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would be either zero or in the opposite 
direction from the results obtained, 
the probabilities given in the table 
are for differences of the obtained 
size in either direction. 

Table II shows that for all extiac- 
tion trials, a difference between 
massed and spaced extinction of the 
obtained size and in either direction 
approaches significance, and for the 
last half of the trials, where the diver- 
gence is apparent, the difference is 
highly significant. 

The question immediately arises as 
to whether this difference between 
massed and spaced extinction holds 
both for groups which were trained 
with massed trials and for groups 
which were trained with spaced trials. 
Fig. 2 and Table III present the data 
with this breakdown. In order to 
permit comparisons of the curves, the 
median cut-off point used in all curves 
was the overall median of 16.3 sec., 


but for maximum sensitivity in the 
significance tests, the median cut-off 
points used were in each case for the 


particular groups and number of 


AFTER MASSED TRAINING 
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trials being compared. As before, 
the probabilities in the table are for 
both tails of the distribution. 

Fig. 2 shows that the faster extinc- 
tion with spaced extinction trials is 
obtained both following massed train- 
ing trials and following spaced training 
trials, but that the difference is more 
marked after massed training. And 
the significance tests in Table III 
show that for groups trained with 
massed trials, spaced extinction is 
significantly faster both for all ex- 
tinction trials and for the last half of 
the trials; but for groups trained with 
spaced trials, the differences, while in 
the same direction, are not significant. 

The generality of the difference in 
rate of extinction with massed and 
spaced trials was further tested by 
analyzing separately the results for 
groups trained with 100-percent rein- 
forcement and those for groups trained 
with so-percent reinforcement. The 
pattern was very similar for the two 
reinforcement groups and confirmed 
the finding that spaced extinction 
proceeds at a faster rate than massed 
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Comparison of massed and spaced extinction following massed training and following 


spaced training, showing percentage of responses which equalled or were below 16.3 sec. (the overall 


median response time). 
trials. 


The reference points at the beginning are the values for the last five training 





DISTRIBUTION OF EXTINCTION TRIALS 


TABLE Ill 


Mean Number or Responses Meetinc Crrrerton,* Massep Extinction Comparep 
with Spacep, Broxen Down sy Distaisution or Trarninc TRIALs 








Mean Score 





Massed Ext. | Spaced Ext. 





After massed tng. 
All 30 ext. trials 17.3 I 
(Md. = 14.2 sec.) 
Last 15 ext. trials 10.0 
(Md. = 24.4 sec.) 

After spaced tng. 
All 30 ext. trials 16.9 I 
(Md. = 19.1 sec.) 
Last 15 ext. trials 9.2 
(Md. = 27.0 sec.) 








2.8 45 
49 5-1 


3.1 3.8 
5.6 3.6 : 1.6 




















*L.e., mean number of responses equal to or below the median response time for the particular 


groups and number of trials being considered. 
** For both tails of distribution. 


extinction. For all 30 extinction 
trials the probabilities that a differ- 
ence in either direction as large as 
that obtained would arise by chance 
are .17 for 100-percent reinforcement 
and .17 for so-percent reinforcement. 
For the last half of the extinction 
trials the probabilities are .03 and .o2 
respectively. As in the overall analy- 
sis, the differences are significant for 
that part of the extinction curve where 
the divergence is apparent. 


Discussion 


The results obtained in this experi- 
ment with massed and spaced extinc- 
tion are surprising in view of previous 
research and conclusions on the sub- 
ject. However, although previous 
workers have almost uniformly favored 
the conclusion that spacing of extinc- 
tion trials produces greater resistance 
to extinction than massing, they have 
not as uniformly obtained clear-cut 
evidence that this is true. In some 
cases, the data do not appear to 
warrant the conclusion. 


Pavlov (9, pp. 62-53) states, “The 
shorter the pause [between successive 
repetitions of the stimulus without rein- 


forcement] the more quickly will extinc- 
tion of the reflex be obtained, and in 
most cases a smaller number of repeti- 
tions will be required.” He presents 
data, purporting to substantiate the 
statement, on five successive series of 
extinctions of a conditioned salivary 
response in one dog. With intervals of 
2, 4, 8, 16, and 2 min, extinction was ob- 
tained in 1§ min., 20 min., $4 min., over 
2 hours, and 18 min., respectively. 
Speed of extinction, for him, refers to the 
length of the experimental session. As 
Guthrie (2) has pointed out, however, 
the number of nonreinforced presenta- 
tions of the conditioned stimulus (a more 
usual method of measuring speed of 
extinction) is about the same in each 
case, namely, 6, 5, 7, 8 (incomplete), and 
7, respectively, suggesting no relation 
between speed of extinction and interval 
between trials. Furthermore, if extinc- 
tion curves of amount of conditioned 
saliva on successive trials are plotted 
from the data Pavlov presents, the 
curves for the different time intervals 
fall at roughly the same rate. In any 
case, conclusions based on one subject 
should be regarded as highly tentative. 

Another bit of evidence which must 
be considered merely suggestive because 
it involves only one subject is found in an 
experiment by Hilgard and Marquis (4). 
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In a study of the conditioned eyelid 
response in dogs, they incidentally 
tested one animal under two extinction 
conditions. When 60  nonreinforced 
trials were presented 20-40 seconds 
apart, extinction developed rapidly; but 
when the animal was given 10 extinction 
trials per day (at the same intervals) 
every other day for 20 days, day to day 
averages showed little decrement. 

Gagné (1) performed two experiments 
with rats on an elevated runway. In 
the first experiment, using intervals of 
4, 1, 2, 3, $, and 10 min. (the same for 
acquisition and extinction), he found the 
rate of extinction fastest for the two 
shortest intervals, the others being in 
reverse order with slowest extinction for 
the two-min. interval. The second ex- 
periment, though also measuring rate of 
extinction as a function of inter-trial 
interval, involves a second variable and 
cannot be interpreted simply in terms of 
the relationship of inter-trial interval 
and resistance to extinction. 

There are several problems in connec- 
tion with Gagné’s first experiment. One 
is that the various groups, having the 
same intervals in acquisition as in ex- 
tinction, had reached different levels of 
performance when acquisition was termi- 
nated. The rates of extinction for the 
different groups were perfectly correlated 
with their levels of performance at the 
end of training, all extinction curves 
merely converging by the fourth or 
fifth extinction trial. 

Another problem is that, although 
Gagné extinguished to a criterion of 
three min. latency, the only extinction 
data presented are latency curves for the 
first five extinction trials. Since the 
present study revealed a difference only 
after 10 extinction trials, it would appear 
that five extinction trials are not enough 
to form the basis for a conclusion about 
the relative rates of extinction with 
different intervals. 

Rohrer (13), using rats with a modified 
Skinner-box technique that permitted 
discrete trials, tested with extinction 
intervals of 10 and go sec. He used two 
additional variables, number of training 
trials and number of hours of food dep- 


SHEFFIELD 


rivation at the start of extinction. 
Some groups extinguished faster with 
spaced trials, but the groups most like 
those in the present study as regards 
amount of training and level of drive 
extinguished faster with massed trials. 
The explanation for this discrepancy 
may lie, among other possibilities, in the 
response used (operating a bar versus 
running in an alley) or in the time inter- 
val used for spaced trials (90 sec. versus 
1§ min.). 

Porter has performed two studies, one 
(10) on rats running a straight alley to 
food and one (11) on the human eyelid 
response, and concluded that there is no 
relation between rate of extinction and 
interval between extinction trials. 
There might be some question, however, 
as to whether the conclusion is war- 
ranted. In the study on rats, where 
intervals of 0, 5, and 10 min. were used, 
the collection of data on only five ex- 
tinction trials leaves in doubt, as in 
Gagné’s study (1), whether a difference 
might not have developed in later extinc- 
tion trials, the limit of extinction not 
having been reached. 

Even with only five extinction trials, 
however, the study contains suggestive 
evidence supporting the present results. 
Data from incomplete runs, when the 
animals were removed from the apparatus 
for exceeding the two-min. time limit 
used, were not included in making the 
curves of mean running time over the 
five extinction trials. The effect of this 
would be to reduce the mean running 
times by not including the animals hav- 
ing the largest running times. Since 
such incomplete runs occurred most often 
in the group with the longest inter-trial 
interval, the curve for the most widely 
spaced group would be depressed more 
than the curves for the groups with 
shorter intervals. It may be that if 
medians or frequencies had been used 
and all data included, spaced extinction 
would have been faster as in the present 
study. As Porter himself points out in 
a footnote, the relationship between dis- 
tribution of trials and number of incom- 
plete runs, in itself, suggests faster 
extinction with spaced trials. 
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Another measure used, average num- 
ber of sections of the alley entered by 
each group on the five extinction trials, 
also provides data supporting the present 
results. Here data on animals which 
were removed are included in the aver- 
ages; and the curves show a correlation, 
after the second extinction trial, between 
rate of extinction and interval between 
extinction trials, spaced extinction being 
fastest, but no significance tests were 
made. 

Porter’s study (11) on the human 
eyelid response, like his study on rats 
(10), contains some suggestive evidence 
in support of the present results, though 
the conclusion reached was that there is 
no relation between rate of extinction 
and spacing of extinction trials. Porter’s 
subjects were extinguished to a criterion 
of no response on two successive trials, 
but a maximum of 1¢ trials was given. 
His intervals of 10, 20, 40, 80, and 180 
sec. gave as mean number of trials to 
extinction 5.43, 7-25, §.25, $5.14, and 
5.00, respectively, but these means do not 
include subjects who failed to reach the 


extinction criterion within the 15 trials 


presented. If medians are calculated on 
Porter’s data so that all subjects may be 
included, the results are 8, 10, 6, 6.5, and 
§.§ trials to extinction, respectively, 
suggesting faster extinction for the longer 
extinction intervals. Porter presents 
Vincent curves of latency, amplitude, 
and duration of the conditioned response 
during extinction, the data being plotted 
separately for subjects who reached the 
extinction criterion during the 1¢ trials 
and for subjects who did not. The only 
set of curves that shows any consistent 
trend is the latency curve for subjects 
who extinguished to the criterion, and 
in this case the curves for the five groups 
are arranged in perfect order from slowest 
extinction for the 10-sec. group to fastest 
extinction for the 180-sec. group. 
Reynolds (12) conditioned the human 
eyelid response under massed conditions 
(presumably 10-20 sec. between trials) 
and under spaced conditions (1-2 min. 
between trials). Each training group 
was divided for extinction, half being 
extinguished with an interval of 10-20 
sec. and half with an interval of 1-2 
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min. When training had been massed, 
no consistent differences were found 
between massed and spaced extinction; 
but when training had been spaced, 
massed extinction was quite consistently 
faster than spaced extinction. 


Thus it may be seen that the 
evidence in the literature is rather 
tenuous for the commonly-accepted 
conclusion that extinction is faster if 
extinction trials are massed than if 
they are spaced. It would appear 
that further experimental work needs 
to be done to clarify the problem. 

On the surface at least, the present 
results do not support temporary 
inhibition or fatigue theories of ex- 
tinction such as those of Pavlov (9), 
Miller and Dollard (7), and Hull (6). 
Thus, Pavlov states that experimental 
extinction is the result of the building 
up through nonreinforcement of an 
inhibitory state which dissipates with 
the passage of time. According to 
this theory, inhibition summates, as 
repetition of the unreinforced condi- 
tioned stimulus continues, until the 
conditioned response finally disap- 
pears completely. The shorter the 
intervals between extinction trials, 
the less the dissipation of inhibition 
and the faster the required intensity 
of the inhibitory state is reached, i.e., 
the faster will be the rate of experi- 
mental extinction. Miller and Dol- 
lard utilize a fatigue factor as a major 
source of extinction. Continued per- 
formance of a response results in 
muscle tension and fatigue, and cessa- 
tion of the response is rewarded by 
escape from this fatigue. Thus, the 
fatigue drive and the consequent re- 
ward from stopping the response will 
produce extinction unless other drives 
and rewards are dominant. Again, 
the shorter the intervals between 
extinction trials, the faster will fatigue 
build up and the faster will the re- 
sponse tend to drop out. Hull pro- 
poses an interpretation similar to that 
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of Miller and Dollard, but assumes 
more generally a state (‘reactive in- 
hibition’) which results from perform- 
ance of the response, which inhibits 
the response, and which dissipates 
with passage of time. All of these 
theories imply more rapid extinction 
with massed than with spaced extinc- 
tion trials, the opposite of the results 
obtained here. It should be noted, 
however, that Miller and Dollard do 
not propose fatigue as the only 
factor—the running response used 
may not have been sufficiently tiring 
to permit the prediction that massing 
would produce faster extinction. 

A theory of extinction which has 
been proposed by Guthrie (3) and by 
Wendt (15) is that extinction is a 
relearning process, the old response 
dropping out as new and incompatible 
responses are learned. It has been 
claimed (5, 12,13) that this theory 
implies that spaced extinction trials 
should lead to faster extinction since 
they would offer to the learning of the 
new response whatever advantages 
there are in spacing of training trials. 
The previous results indicating that 
massed extinction is faster have been 
cited as evidence against the relearn- 
ing theory of extinctioa. If this 
theory does imply that spaced extinc- 
tion will be faster, then the results of 
the present study support the theory. 
There might be some doubt, however, 
as to whether the implication is cor- 
rect. The theory does not state that 
the extinction of the old response re- 
sults from the learning of some single 
new response incompatible with the 
old response, but that any incompati- 
ble response which happens to occur 
when the cues for the old response are 
present will weaken the tendency for 
the old response to occur subsequently. 
There may be a large number of such 
incompatible responses from trial to 
trial during an extinction session. 

One hypothesis that may explain 
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the results is that frustration genera- 
ted by the omission of reinforcement 
during extinction has a motivational 
effect, which shows up as an increased 
vigor in performing whatever response 
the subject has a tendency to perform. 
With massed extinction trials, the in- 
creased motivation produced by ab- 
sence of reinforcement on one trial 
would still be present when the ani- 
mal was put into the apparatus for his 
next trial and would maintain the 
vigor of the running response, causing 
the animal to continue to go down to 
the end of the alley trial after trial 
and thus slowing extinction. With 
spaced extinction trials, on the other 
hand, the frustration and its accomp- 
anying increase in motivation would 
have dissipated, in greater or less 
degree depending on the interval 
used, by the start of the following 
trial, resulting in a relatively weaker 
response and fasterextinction. Other 
experimenters have noted the in- 
creased agitation produced by the 
nonreinforced trials during extinction 
of a habit based on hunger (e.g., 
Miller and Stevenson, 8). That the 
animals became excited during extinc- 
tion in the present experiment was 
easily apparent from casual observa- 
tion. The animals frequently bit at 
the experimenter’s hand, dashed 
wildly down the alley in spurts and 
starts, etc. But no systematic re- 
cords were made that would permit a 
comparison of massed and spaced 
extinction animals. 

It is possible that this postulated 
increase in motivation would produce 
slower extinction with massed trials 
in some situations and faster extinc- 
tion with massed trials in other situ- 
ations. If the increased motivation 
were channelized into the response 
being measured, as proposed above 
for the running response used in this 
experiment, it would cause massed 
extinction to be slower than spaced 
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extinction. But if, in other learning 
situations, the increased motivation 
were channelized into responses in- 
compatible with the response being 
measured, it would cause massed ex- 
tinction to be faster than spaced ex- 
tinction. For example, in an alley 
there is little for an active animal to 
do except go to the other end, but 
with a bar-pressing response increased 
activity might show up in the form of 
running around inside the box rather 
than in pressing the bar. This differ- 
ence in whether the conditioned re- 
sponse or an incompatible response is 
the one made more vigorous by the 
frustration from  nonreinforcement 
may account for inconsistencies in 
results on the relative speeds of ex- 
tinction with massed and spaced 
extinction trials. 

This hypothesis could not be applied 
in learning situations (4, 11, 12) utiliz- 
ing a noxious reinforcing stimulus, 
such as air puff to the eye or shock, the 
omission of which would not be 
frustrating. 

SUMMARY 


1. Seventy-two rats were trained 
to run down an alley for food. Half 
were trained with a 15-sec. interval 
between trials and half with a 15-min. 
interval. Each training group was 
divided for extinction, half being ex- 
tinguished with the 15-sec. interval 
and half with the 15-min. interval. 
The measure of strength of response 
was time to traverse the alley. 

2. Extinction proceeded at a sig- 
nificantly faster rate when extinction 
trials were spaced than when they were 
massed. 

3. For the group trained with 
massed trials, spaced extinction was 
very significantly faster; but for the 
gioup trained with spaced trials, the 
difference, while in the same direction, 
was not significant. 

4. A hypothesis is proposed to 
explain these results in terms of 
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increased motivation resulting from 
the frustration produced by nonrein- 
forcement. 


(Manuscript received May 24, 1949) 
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TRANSFER OF DISCRIMINATION TRAINING 
TO A MOTOR TASK! 


BY ROBERT M. GAGNE,? KATHERINE E. BAKER 
Connecticut College 
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HARRIET FOSTER 
University of Michigan 


The learning of one class of motor 
skills involves the establishment of 
different responses to each of a given 
set of stimuli. The present study 
deals with the learning of this kind of 
a skill, namely, one in which a subject 
must learn to make a number of differ- 
ent manual responses to the same 
number of visual stimuli. In previ- 
ous studies (1,2) such learning has 
been conceived in terms of the estab- 
lishment of discriminative habits by 
differential reinforcement of the re- 
sponses to be learned. The present 


study was designed to investigate the 
extent to which preliminary training 
on individual discriminations facili- 
tates the learning of the total set of 
discriminations present in the whole 
task. 


A previous experiment (1) demon- 
strated that a high degree of transfer of 
training results from practice on the 
components of a motor skill. The skill 
studied involved four different manual 
responses, each of which was to be asso- 
ciated with one of the four stimulus 
lights on a stimulus panel. In this task 
the individual was required to learn to 
make responses to switches 1 and 2 on 
the left of the panel on the appearance of 
a red and a green light (respectively) on 


' This article is a slightly modified version of 
Report 316-1-6 under Contract N7onr-316, Task 
Order I, between the Special Devices Center, 
Office of Naval Research, and Connecticut 
College. 

* Now at Perceptual and Motor Skills Re- 
search Laboratory, ATRC Human Resources 
Research Center, Lackland Air Force Base, 
Texas. 


the bottom of a stimulus panel, and to 
make responses to switches 1 and 2 on 
the right to a red and a green light 
(respectively) at the top of the stimulus 
panel. To make a correct response, a 
color discrimination was required be- 
tween the lights of each pair in a given 
position, and a position discrimination 
between lights of a given color in differ- 
ent locations. In the learning of a cor- 
rect response to a given stimulus of the 
total task, both kinds of discrimination 
were involved. 

Two types of initial training were 
given in this original experiment. One 
group practiced the responses required 
for lights differing only in position, while 
another group practiced responses to 
lights which differed only in color. 
Thus, one kind of initial training involved 
the learning of a position discrimination 
while the other kind involved the learn- 
ing of color discrimination. The results 
of the experiment indicated considerable 
positive transfer from 30 trials of pre- 
liminary training on either the position 
or the color discrimination. The evi- 
dence showed that 30 trials of this type 
of preliminary training markedly reduced 
the number of errors below those made by 
a control group. Furthermore, it was 
shown that the greater proportion of this 
error reduction resulted from the elimina- 


.tion of errors to the specific stimuli 


employed in the preliminary training. 
The question left unanswered by the 
results of this first study concerns the 
degree of specificity of the transfer which 
was found. It would be of great interest 
to know the extent to which the learning 
of the discriminations involved in the 
total task is facilitated by the prelimi- 
nary training on both color discrimination 
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and position discrimination. Further- 
more, it would be interesting to discover 
to what extent such transfer of discrimi- 
nation, if it is found, is independent of the 
actual stimuli and responses which are 
common to both the preliminary and 
final tasks. These questions were not 
answered by the original experiment be- 
cause no means were provided for deter- 
mining the type of error made to each 
presented stimulus. 


The purpose of the study reported 
here was to discover the extent to 
which the transfer from specific dis- 
crimination training to the total task 
is confined to the actual stimuli and 
responses common to both tasks, and 
the extent to which it is a more gen- 
eral effect involving the stimuli and 
responses not used in preliminary 
training. In other words, the present 


experiment is concerned with deter- 
mining whether the transfer of dis- 
crimination training really takes place 
independently of transfer of specific 


S-R associations. The experiment 
involved giving 30 trials of position 
discrimination training to one group 
of subjects and the same amount of 
color discrimination training to an- 
other group, followed in each case by 
the learning of a total task which in- 
volves both kinds of discrimination. 
It is reasoned that if the transfer of 
discrimination training is a reality, po- 
sition-discrimination training should 
facilitate subsequent learning of this 
kind of discrimination in the total 
task, not only when the stimuli and 
responses employed in the preliminary 
training are presented, but also when 
the stimuli not so used are presented. 
Likewise, color-discrimination train- 
ing should facilitate the learning of 
color-discrimination in the total task, 
not only in connection with the spe- 
cific stimuli and responses used in 
training, but also to the stimuli and 
responses not so used. In each case 
the degree of facilitation that the pre- 
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liminary training provides should be 
measurable as a reduction in errors of 
each specific type. 

The results of the previous study 
were interpreted in terms of a hy- 
pothesis which makes use of the con- 
cept of generalization. It was rea- 
soned that the preliminary training 
given on part of the task had the 
effect of reducing the generalization 
tendencies of each stimulus used. 
Thus, preliminary training on one 
kind of discrimination should bring 
about a reduction in this particular 
type of error to all of the stimuli of the 
final task, provided that these stimuli 
bear a close enough resemblance to 
the ones used in preliminary training 
so that they can be presumed to fall 
within the range of the generalization 
gradient. A second implication of 
this hypothesis may also be pointed 
out. Since the subject is required in 
the present task to discriminate each 
stimulus light on the basis of two 
different dimensions (i.e. color and 
position), it may be expected that, at 
the beginning of learning, different 
degrees of generalized response tend- 
ency will exist for each of the two dif- 
ferent dimensions. In other words, 
one might expect the subject to ex- 
hibit greater tendencies for error (i.e., 
generalized response tendencies) along 
one dimension than he shows along 
the other. Consequently, the effec- 
tiveness of the preliminary training in 
reducing generalization tendencies 
may depend upon the degree to which 
these tendencies are present before 
training begins. Reducing the gen- 
eralization tendencies along the di- 
mension in which discrimination is 
the more difficult should have the 
greater effect on the subsequent 
learning of the total task, and con- 
sequently bring about the greater 
amount of transfer. The present 
experiment provides evidence for these 
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additional implications of the hy- 
pothesis. 


Metuop 


A pparatus.—The motor task in this study was 
identical with the one employed in a previous 
investigation of transfer of practice on compo- 
nents on a motor skill (1). In general, the task 
required the subject to hit one of four switches on 
a horizontal panel in response to one of four 
lights appearing on a vertical stimulus panel. 
The stimulus lights were arranged in two pairs, 
separated by 7}$ in. in the vertical dimension. 
One light of each pair was green, the other red. 
The response switches were also arranged in 
pairs, one pair on the left, the other pair on the 
right of the mid-line of the response panel. Of 
each pair the left-hand switch was numbered 1 
and the right-hand switch was numbered 2. 
The subject’s task was to press the appropriate 
switch as rapidly as possible as soon as a given 
light appeared. A correct response turned out 
the light and stopped the electric timer used to 
measure time of reaction. The apparatus was 
so constructed that the top lights were turned 
out with the right-hand pair of switches, while 
the bottom lights were turned out with the left- 
hand pair of switches. In each case, switch 
number 1 was always associated with a red light 
and switch number 2 with a green light. The 
subject always started his response from a cen- 
tral starting button which had to be depressed 
for any light to be turned on. The apparatus 
permitted the measurement of time of reaction 
as well as the recording of the number and loca- 
tion of incorrect switches pressed by the subject. 

Subjects. —A total of 140 Navy enlisted men 
served as subjects in the experiment. 28 of these 
men formed the control group (Group C) and 
practiced the total task during 60 presentations 
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of the stimulus lights with no preceding initial 
training. A second group (Group CD) received 
30 trials of initial practice on reactions involving 
color discrimination only. Since color dis- 
criminations are required within each pair of 
stimulus lights, this group was divided into two 
sub-groups of 28 men each, and one sub-group 
practiced the color discrimination involving the 
top pair of lights while the other sub-group 
practiced the same discrimination involving the 
bottom pair of lights. A third group containing 
56 subjects (Group PD) was given 30 trials of 
training on position discrimination prior to 
learning the total motor task. This group was 
likewise divided into two groups of 28 subjects 
each. One of these sub-groups received training 
in position discrimination using the top and 
bottom red lights, while the other sub-group 
practiced position discrimination using the top 
and bottom green lights. 

In order that differences among the results 
from the experimental and control groups could 
reasonably be attributed to differences in condi- 
tions of training, all groups of subjects were 
matched on the basis of two criteria previously 
shown to correlate with the task used in the 
experiment. One criterion was a learning score 
obtained from the administration of the full-page 
Woodworth-Wells substitution test with a time 
limit of 90 sec. The second criterion was a 
measure of reaction time obtained by averaging 
times for 25 reactions to a single switch in re- 
sponse to the bottom green light of the apparatus. 
The multiple correlations of these two criteria 
with the average response time on the total task 
has been found to be .424, and with total errors, 
.350. Table I presents the means and standard 
deviations of scores on the two matching criteria 
for each sub-group. The differences in means 


TABLE I 


Scores on Marcuinc Crirerta ror Groups anp Sus-Groups 
Usep tn THE Experiment 





Groups 





CD (1) 
N =28 


PD (1) 
N = 28 


PD (2) CD (1&2) 
N =28 N =56 


PD (1&2) 
N =56 





Substitution Test— 
number right 
Mean 


SD 


Reaction Time 0.01 
second 
Mean 
SD 











63.32 
10.4 


63.46 
10.8 


63.40 
10.5 


63.41 


40.89 41.89 
5-4 46 7.3 


42.89 
8.6 
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and in variability among groups are not statisti- 
ally significant. 

Experimental procedure —Each of the experi- 
mental groups (Groups CS and PD) was given 
30 trials of initial training on one of the two 
types of discrimination followed by 60 trials of 
practice on the total motor task. Half of the 
subjects in Group CD practiced color discrimina- 
tion on the bottom pair of red and green lights, 
half with the top pair. Half of the subjects in 
Group PD practiced the position discrimination 
with the top and bottom green lights, half with 
the red lights. During this initial training, the 
two switches which were not used in each case 
were covered with rounded wooden covers which 
did not interfere with the required reactions. 
Group C, the control group, proceeded immedi- 
ately to learning the total motor task involving 
four switches and four lights, for 60 trials. In 
all cases the lights were presented in a predeter- 
mined random order. Four different random 
orders were used on equal numbers of subjects 
within each of the sub-groups. 

It should be made perfectly clear that the 
terms ‘color discrimination’ and ‘position dis- 
crimination,’ as used in this paper, are not in- 
tended to have a generality beyond that of 
referring to the specific stimuli and responses 
used in the present experimental situation. 
Thus, as color discrimination training continued, 
we can say that the subject showed improvement 
in the connections Srea—Rewiteh 1 aNd Sareea 
Rewitch 2, in the sense that the time of both re- 
sponses and the number of incorrect responses 
were reduced. In the case of position discrimin- 
ation training, an improvement in performance 
is taken to indicate a progressive establishment 
of the conncetions Spottom—Rewiteh 1 and Stop— 
Rewiteh n- The learning relationships found do 
not, then, reflect differences between discrimina- 
tions of color and of position in general, but only 
between the acquired discriminations ‘which 
occur in the learning situation employed here. 
These discriminations are identified in each case 
by means of the cue which is most obvious in 
terms of experimental procedure, but which is 
in all probability not the only one involved. 

The instructions given to the subjects in the 
experimental groups were identical with those 
employed in the previous study (1) except for 
appropriate changes demanded by the specific 
types of discrimination training employed. The 
instructions to the contro! group which began to 
learn the motor task without this initial training 
were also identical with those used previously. 
Subjects were told the correct response for each 
light and were instructed to respond as quickly 
as possible whenever a light appeared on the 
stimulus panel. 

The learning score used for matching pur- 
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poses was obtained from each subject at the 
beginning of each experimental session by an 
administration of the digit-symbol substitution 
test with a time limit of go sec. The score 
employed was number of correct substitutions, 
The measure of reaction time used in matching 
was obtained at the end of the experimental ses- 
sion. The subject was instructed to react as 
quickly as possible to the lower green light by 
pressing switch number 2 on the left (the re- 
sponse he had already learned to make to this 
stimulus). Actually, the measure used was that 
of ‘discrimination’ reaction time, since the sub- 
ject was instructed not to respond to the upper 
green light whenever it appeared. This proce- 
dure was used to prevent a possible tendency to 
respond to an auditory cue provided by the 
operator’s switch. The score employed was the 
average time of reaction to 25 presentations of 
the lower green light. 


RESULTs 


Transfer in terms of response time 


The results obtained in this experi- 
ment, in terms of the measure of mean 
response time in successive 10-trial 
stages of learning the final task, are 
similar in every important respect to 
those obtained in a previous experi- 
ment (1). Thirty trials of prelimi- 
nary training on either color dis- 
crimination or position discrimination 
result in considerable positive transfer 
to the final motor task involving both 
types of discrimination. Both the 
experimental groups have signifi- 
cantly shorter response times than 
those of the control group throughout 
the sixty trials, although the reduction 
is particularly pronounced in the 
initial stages of learning. These re- 
sults confirm those which were ob- 
tained in the previous study. 

It is of some importance to deter- 
mine to what extent the reduction in 
time scores on the final task is de- 
pendent upon the fact that the pre- 
liminary discrimination training em- 
ployed one of the two pairs of lights 
and switches used in the final task. 
Preliminary training may reduce re- 
sponse time to the practiced lights 











318 


R. M. GAGNE, K. E. BAKER, AND H. FOSTER 


TABLE I 


Time or Response per Triat (iw 1/100 Sec.) ron Stmuut Usep anv ror Stimuti Nor 
Usep 1x Pretowiwary Tramiunc, Comparep witx Controt Group VaLves In 
Successive 20-Triat Staces or Leaaninc tue Fivat Tasx 











41-60 





Group C 


Group CD 
rained lights 


Untrained lights 


Group PD 
Frained lights 


Untrained lights 








85.33 
47-1 


83.61 


79.09 
SD 478 


49.6 


82.83 
45-5 


77.01 
41-4 


79-54 
38.5 


91.59 
SD 57-3 


80.86 
45-4 


75-05 
34.6 


82.99 
42.7 


82.24 
42.3 


Mean 93-11 
SD 53-7 
Mean 91.39 
SD 53.8 














only, or such training may have a 
more general effect and also serve to 
reduce time scores for the other pair 
of stimulus lights. An answer to this 
question may be obtained by breaking 
down the data for each experimental 
group into two categories: time of 
response to lights used in training, and 
time of response to lights not used in 
training. The results of such an 
analysis of the data appear in Table 
II. Time scores are given for suc- 
cessive 20-trial stages of learning. 
This presentation of the data shows 
that the average time taken to respond 
correctly to the lights used in training 
is not very different from the average 
time of responses to the untrained 
lights. (An exact statement of the 
significance of the differences would 
require knowing the correlation be- 
tween response time to the trained 
and untrained lights. However, in 
view of the large standard deviations 
obtained, it seems fairly certain that 
these differences are not significant.) 
Thus, it would appear that prelimi- 
nary training which involves practicing 


either differential responses to color 
or differential responses to position is 
in fact discrimination training, which 
facilitates the learning of other re- 
sponses to stimuli which have not 
entered into the training. The trans- 
fer effects of such discrimination train- 
ing may not be accounted for on the 
basis of having increased the speed of 
making specific responses to the 
specific stimuli used in training. 


Transfer in terms of error scores 


Amount of transfer —Another meas- 
ure which is significant in analyzing 
the transfer effects of discrimination 
training is that of number of errors. 
Table III presents, for each group, 
the average number of errors made 
during each 10-trial stage of learning 
of the total motor task. These data 
represent the average number of 
initial errors’? made, without refer- 

* In performing the final motor task, a subject 
may on one occasion press more than one incor- 
rect switch before finally making a correct 
response. Throughout the present paper only 


the first of these incorrect responses is counted 
as an error. 
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ence to whether the error represented 
a confusion of color or a confusion of 
position. The transfer effects of pre- 
liminary discrimination training are 
revealed in the differences between 
error scores for the control group and 
for the two experimental groups 
trained on color discrimination and on 
position discrimination. As the table 
indicates, both the experimental 
groups begin the learning of the final 
task with markedly reduced error 
scores. The CR’s of these differences 
show that color discrimination has 
significantly reduced the errors made 
during the first 20 trials. On the 
other hand, position discrimination 
training results in considerably less 
positive transfer, which appears sig- 
nificantly during trials 11-20 only. 
The greater effectiveness of the color 
discrimination training in reducing 
errors is revealed by significant differ- 
ences (at least at the five percent 


level) between the two experimental 
groups on the first 20 trials of learning. 
In the case of both trained groups, 
discrimination training brings about 
a reduction in errors for trials 11-20, 
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the point at which a marked increase 
typically occurs in the control group. 

The final column of Table III gives 
the average number of errors made 
during the entire learning by each 
group. Color discrimination training 
reduces the total errors to a significant 
degree, while position discrimination 
training does not. 

Type of error reduced by discrimina- 
tion training.—The results presented 
so far establish the fact that both 
color-discrimination training and posi- 
tion-discrimination training exhibit 
positive transfer to the learning of the 
total task, when all types of errors are 
included. It is of particular interest, 
also, to inquire whether or not errors 
resulting from particular types of 
confusion are affected differently by 
the two kinds of training. In the 
final motor task, a subject may make 
three different types of overt errors: 
(1) He may press a switch which is 
correct in the sense that it is associ- 
ated with the proper pair of lights in 
either the top or bottom position, but 
incorrect in the sense that it is associ- 
ated with the wrong color within the 


TABLE Ill 


Averace Number or Errors 1x Successive 10-Triat Staces or LEARNING 
tHE Tota Sxitt, Toceruer wirn CR’s or Dirrerences Between 
VALUES FoR THE Various Groups 








Group 








Group C 


Group CD 


Group PD 











Critical Ratios: 
Group C-Group CD 
Group C-Group PD 
Group PD-Group CD 
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TABLE IV 


Averace Numser or Coror, Posrrion, ann Comsrnation Cotor axp Posrrion Errors 
Mane sy Eacn Group ww Successive 10-Triat Staces or Learninc THe Frnar Tasx 





Type of Error 








Color Errors 
Group C 
(N= 28) 


Group CD 
(N = 56) 


Group PD 
(N = 56) 





Position Errors 


Group C 
(WV = 28) 


Group CD 
(N= 56) 


Group PD 
(N= 56) 





Combination Errors 
Group C 0.61 
(N= 28) 0.8 


Group CD 0.25 
(N= 56) 0.6 


Group PD 0.25 
(N= 56) 0.5 


























pair. This kind of error will be desig- 
nated a Color Error. (2) He may 
press a switch which is correct as to 
color but incorrect so far as the posi- 
tion of the stimulus is concerned. 
This is called a Position Error. (3) 
He may press a switch which is incor- 
rect in terms of both the color and the 
position of the stimulus. This type 
is called a Combination Error. It 
might be expected that color-dis- 
crimination training would tend to 
reduce Color Errors to a greater ex- 
tent than it does Position Errors. 
Likewise, training in position dis- 
crimination might be expected to be 
particularly effective in reducing Po- 
sition Errors. 


The data for this analysis are given 
in Table IV, which presents the error 
scores of the two experimental groups 
and the control group for successive 
10-trial stages of final learning. Er- 
rors have been divided into the three 
types: Color, Position, and Combina- 
tion. The CR’s for the differences in 
these values between groups and be- 
tween error types are presented in 
Table V. 

Considering first the data from the 
control group only, it is evident from 
Table III that without discrimination 
training more errors resulting from 
color confusions occur than do errors 
indicating confusion of position. 
While Color Errors are considerably 
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reduced during the 60 trials of learn- 
ing, Position Errors are very little 
affected. The differences between the 
mean number of these two types of 
errors for Group C are significant 
over the first 30 trials of learning, as 
shown in Table V. (The correlation 
between total Color and Position 
Errors is 0.16. Four random orders 
of presentation of stimuli afforded an 
equal number of opportunities to 
make each type of error within each 
ro trials for the group as a whole.) 
It may be seen, then, that the results 
in terms of errors point to the con- 
clusion that the color discrimination 
is the more difficult of the two dis- 
criminations provided by the total 
motor task. Furthermore, color dis- 
crimination is affected more by con- 
tinued practice than is the less 


difficult position discrimination. 
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Turning now to a consideration of 
the effect of color-discrimination train- 
ing upon the reduction of specific 
types of errors, the results can be seen 
in the contrast of values for Group C 
and Group CD. A considerable re- 
duction in Color Errors has evidently 
been brought about during the first 
30 trials of final learning as a result of 
preliminary color discrimination train- 
ing. In terms of Color Errors, the 
control group required approximately 
40 trials to reach the level of perform- 
ance maintained by Group CD, which 
had been given 30 preliminary trials 
of practice on color discrimination. 
The difference between groups is sig- 
nificant at the five percent level or 
better over the first 30 trials of final 
learning. The reduction in Position 
Errors which results from color dis- 
crimination training is an additional 


TABLE V 


CriticaL Ratios ror THe Dirrerences Berween Mean Vatvues Saown 
tn Tasie IV, ror Groups, anp ror Type or Error 
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Color Errors 
Group C-Group CD 
Group C-Group PD 
Group PD-Group CD 





Position Errors 
Group C-Group CD 
Group C-Group PD 
Group PD-Group CD 





Combination Errors 
Group C-Group CD 
Group C-Group PD 
Group PD-Group CD 





Group C 


C Errors-P Errors 





Group CD 
C Errors-P Errors 





Group PD 
C Errors-P Errors 
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fact of interest shown by these data. 
Although by no means as striking as 
the reduction in Color Errors (only 
two of the differences between Group 
CD and Group C are statistically 
significant), the trend is quite con- 
sistent. It might not be expected 
that color discrimination training 
would have any effect upon Position 
Errors in the total motor task. It 
seems reasonable to conclude from 
these data, however, that color-dis- 
crimination training has had a more 
general effect in reducing errors, in 
addition to its specific effect of reduc- 
ing errors of color confusion. It may 
be noted in support of this conclusion 
that Group CD also made significantly 
fewer Combination Errors than Group 
C throughout the first 20 trials of 
final learning. 

The data for Group PD show the 
effect position-discrimination training 
has on the reduction in Color and 
Position Errors during the learning of 
the total motor task. First of all, it 
may be noted that the preliminary 
training has accomplished some reduc- 
tion in Position Errors. Actually, 
the amount of this reduction is about 
the same as that produced by color- 
discrimination training. The differ- 
ence in errors between Groups C and 
PD is significant for trials 11-20 and 
31-40. On the other hand, the pre- 
liminary training in position discrim- 
ination has produced no reduction in 
Color Errors. The group which has 
had training in position discrimination 
exhibits essentially the same number 
of Color Errors during the final learn- 
ing as does the control group. A 
greater number of these errors occur 
during the final stages of the learning 
of Group PD than is the case in Group 
C. It may also be seen from Table 
IV that the preliminary training in 
position discrimination (Group PD) 
has brought about a reduction in 
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Combination Errors by an amount 
about equal to that effected by color- 
discrimination training (Group CD). 
Color and position errors to trained 
and untrained lights.—The results so 
far presented serve to establish the 
fact that color discrimination brings 
about a reduction in Color Errors and 
a somewhat less marked reduction in 
Position Errors, whereas position- 
discrimination training brings about 
some reduction in Position Errors but 
none at all in Color Errors. Analo- 
gous to the question posed for response 
time scores, it is of interest now to 
know if any part of the reduction in 
the different types of errors brought 
about by training may be attributed 
to the reduction in errors to the pair 
of lights not used in the training. It 
is possible that the trained individual 
does not make errors in responding to 
the particular set of lights used in 
training, but when he is presented 
with an unpracticed light, he makes as 
many errors as the untrained indi- 
vidual. On the other hand, since 
response time is reduced for untrained 
lights as well as for trained lights, dis- 
crimination training may have a gen- 
eralized effect on errors too, and thus 
reduce errors to unpracticed lights. 
The data providing an answer to 
this question are given in Table VI, 
which presents the error score for each 
type of error (Color, Position, Com- 
bination) made to those stimuli used 
in preliminary training, and to those 
stimuli not used in preliminary train- 
ing. Error scores are given for suc- 
cessive 20-trial stages of final learning. 
Since half of the total number of lights 
was used in the training of each group, 
the error scores on trained and un- 
trained lights for the experimental 
groups are compared with half the 
mean number of errors made by the 
control group. Fig. 1 presents graphi- 
cally the relationships revealed by 
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TABLE VI 


Averace Number or Coror, Posrrion, anp Compination Errors Mane sy Eacu Group 
to Stimuti Usep anv to Stisvi: Not Usep mx Pretmunary Tratninc in Successive 
Staces or Learninc Tue Finat Tasx, Tocetuer wire Criticar 
Ratios oF THe Dirrerences Between Tuese VaLues 











Trials 1-20 


Trials 21-40 Trials 41-60 





Cotor | Pos. | Comb. 
Errors | Errors | Errors 





Color 
Errors 





Pos. | Comb. | Color | Pos 
Errors | Errors | Errors | Errors 





0.61 
0.71 


1.45 | 0.98 


1.48 





0.64 
0.95 


0.14 
0.40 


0.29 
0.56 


0.20 
0.44 


0.18 





0.82 
0.68 


1.41 


0.55 





Critical Ratios: 
Group C- 
T Group CD 


Group C- 
UT Group cD 








Cc 





1.04 











0.46 


| 2.17 
1.41 


| 1.86 


2.93 








0.13 











these data. Errors on trained and 
untrained stimuli made by each group 
may be compared for Color Errors 
(left graph), Position Errors (middle 
graph), and Combination Errors (right 
graph). 

As the data of Table IV have al- 
ready indicated, the most significant 
effects may be expected to reveal 
themselves during the first 20 trials of 
final learning. Devoting our attea- 
tion to these trials in each case, we 
can find evidence that the effects of 
preliminary training have not been 
confined to the specific influences of 
practice on particular stimuli. In the 
left hand graph it may be noted that 


Group CD, which was trained on 
color discrimination, shows, for trials 
1-20, significantly fewer Color Errors 
than Group C on the lights not used 
in preliminary training as well as on 
trained lights. The greater reduction 
in Color Errors in this group occurs in 
the case of the lights used in training, 
but the tendency for this training to 
generalize to the untrained lights is 
nevertheless a significant one. Simi- 
lar conclusions can be drawn with 
regard to the effect of preliminary 
color-discrimination training on the 
reduction of Position Errors. The 
training evidently brought about not 
only a reduction of this type of error 
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Fic. 1. 


Color errors, position errors, and combination errors made by each group on successive 


20-trial stages of final task learning, to those light stimuli used in preliminary training, and to those 
not used in preliminary training. For each type of error, comparison is made with half the mean 
values for the control group, in order to make the number of presented stimuli equivalent. 


to lights used in preliminary training, 
but also a significant, though smaller, 
reduction in errors to the untrained 
lights. 

In the case of Group PD which re- 
ceived position discrimination train- 
ing, there is a significant reduction in 
Position Errors (center graph) to the 
lights which were not used in prelimi- 
nary training. In this instance, re- 
markably enough, it is found that 
Position Errors to the lights previ- 
ously used in training are not sig- 
nificantly reduced. However, as pre- 
viously pointed out, Color Errors are 
not reduced for either trained or un- 
trained lights. 

Errors resulting from confusion of 
both color and position of lights 
(Combination Errors) are less fre- 
quent than either of the other two 
types, but are nevertheless reduced 
by both types of discrimination train- 
ing. These results may be seen in the 


right hand graph of Fig. 1. The dif- 
ferences in errors between the Control 
Group and both experimental groups 
are significant for trials 1-20, in the 
case of both trained and untrained 
stimuli. Thus, so far as Combination 
Errors are concerned, both types of 
preliminary training may be said to 
have a generalized effect as well as the 
specific effect of reducing errors to 
the stimuli on which preliminary 
training has been given. 


Discussion 


The results which have been re- 
ported indicate that training on either 
color discrimination or position dis- 
crimination transfers to a considerable 
degree to the learning of a motor task 
involving both these types of dis- 
crimination. When transfer is meas- 
ured in terms of a reduction in response 
time, the two types of training are 
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apparently equally effective. On the 
other hand, when transfer is meas- 
ured as a reduction in errors, the color 
discrimination training turns out to 
be more effective for transfer than 
does the position discrimination train- 
ing. 

In seeking the reason for the dif- 
ferential effectiveness of these two 
types of initial training, we first 
notice the fact that learning the posi- 
tion discrimination in the present 
situation is a less difficult task than is 
learning the color discrimination. 
That is to say, subjects exhibited 
fewer confusions between right and 
left switches after establishing the 
habits Spottom—Reswiteh 1 and Stop— 
Rewikh en than they did between 
switches 1 and 2 after establishing 
the habits Srea—Rewiteh 1 and Sareen 
—Regwiterh 2 This is shown by com- 
paring the number of Position Errors 
with the number of Color Errors 


made by the control group during 60 
trials of learning the total task (cf. 


Table IV). When the stimuli differ 
from each other in both color and 
position, the discrimination is still 
easier to learn, as the small number of 
Combination Errors indicates. In 
other words, when faced with the 
necessity of reacting to a given stimu- 
lus of the motor task, the subject 
sometimes makes the response ap- 
propriate to the other color, less fre- 
quently the response appropriate to 
the other position, and least frequently 
of all, the response appropriate to 
both the other color and the other 
position. 

Holding this fact in reserve for a 
moment, let us consider another 
aspect of the present findings. What 
is the effect of specific kinds of dis- 
crimination training on the type of 
errors made in the final task? In the 
first place, if the transfer of discrimi- 
nation training is not confined to the 
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exact stimuli used in such training 
(and Fig. 1 shows that it is not), one 
might expect that color-discrimina- 
tion training would reduce the Color 
Errors made in the final task, that 
position-discrimination training would 
reduce the Position Errors, and that 
both kinds of training would reduce 
Combination Errors. Each of these 
things is established by our data (cf. 
Table IV). Recalling now the differ- 
ential difficulty of the two types of 
discrimination, one might go on to 
suppose that the reason for the greater 
effectiveness of color-discrimination 
training in reducing errors is simply 
that there are more Color Errors to be 
reduced in the first place. 

But the data of Fig. 1 show that 
this interpretation is not adequate. 
Not only does color-discrimination 
training reduce Color Errors in the 
final task, but Position Errors as well. 
In contrast, training on position dis- 
crimination has no effect on Color 
Errors, and brings about a reduction 
in Position Errors no greater than 
that effected by color-discrimination 
training. In other words, the data 
do not support the idea that the kind 
of error reduction in the final task is 
specific to the kind of discrimination 
training provided in the initial task. 
Rather than this, they indicate a 
relationship between the difficulty of 
the trained discrimination, regardless 
of the type of cue involved, and the 
amount of error reduction. Thus, 
the data support the following general- 
ization: preliminary training on a 
given discrimination reduces the final- 
task errors associated with discrim- 
inations equally or less difficult, but 
does not reduce those errors associ- 
ated with a more difficult discrimina- 
tion. In other words, color discrim- 
ination training has reduced Color 
Errors (associated with equally diffi- 
cult discriminations) and also Position 
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and Combination Errors (associated 
with less difficult discriminations). 
Position-discrimination training has 
reduced Position Errors (associated 
with equally difficult discriminations), 
and Combination Errors (associated 
with less difficult discriminations), but 
has not reduced Color Errors (associ- 
ated with a more difficult discrimina- 
tion.) 

The general effect of discrimination 
training on error reduction is pictured 
in Fig. 2. In this figure, the three 
types of errors are plotted along the 
horizontal axis in order of decreasing 
difficulty of the discrimination with 
which each type is associated. (The 
relative size of these units of difficulty 
is, of course, not known.) On the 
vertical axis, the number of errors of 
each type is expressed as a percent of 
the total number of errors made by 
the control group. The latter value 


represents a kind of ‘total possible’ 


error score to which the amount of 


(RELATIVE STRENGTH OF RESPONSE TENDENCY) 


PERCENT OF TOTAL CONTROL GROUP ERRORS 





i i i 


c Ld come 
TYPE OF ERROR 
(OMFICULTY OF DISCRIMINATION) 





Fic. 2. The percent which final task errors 
of each type are of the total errors made by the 
control group. Group C, control group; Group 
CD, group given preliminary color discrimina- 
tion training; Group PD, group given prelimi- 
nary position discrimination training. As a 
representation of generalization gradients, the 
graph indicates the relative amount of reduction 
in strength of generalized responses between 
stimuli of three degrees of similarity, brought 
about by two different types of training. 
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each type of error reduction can be 
related. Thus, the graph shows that 
of the ‘total possible’ errors made by 
the control group, $2 percent were 
Color Errors, 33 percent Position 
Errors, and 15 percent Combination 
Errors. For the group trained in po- 
sition discrimination the percent of 
this same ‘total possible’ value that 
each type of error represents is as 
follows: Color Errors, 58 percent; 
Position Errors, 24 percent; Combina- 
tion Errors, g percent. For the 
group trained in color discrimination 
the corresponding percents are: Color 
Errors, 42 percent; Position Errors, 
24 percent; Combination Errors, 8 
percent. Thus, for the group given 
training on color discrimination, er- 
rors have been reduced from Color 
Errors out to the end of the scale of 
decreasing difficulty. For the group 
trained in position discrimination, 
errors have been reduced only from 
the point of Position Errors on. 

The transfer effects of training on 
the components of a total motor skill 
have previously been accounted for 
by means of a hypothesis which makes 
use of the concept of generalization 
(1). The present results provide 
confirmation for certain further im- 
plications of this hypothesis. In par- 
ticular, they are consistent with the 
idea that the effect of training is to 
reduce the amount of generalization 
associated with the discrimination on 
which training has been given, and 
not simply to increase the strength of 
a connection between a specific stimu- 
lus and a specific response. Without 
attempting here to explore completely 
the theoretical implications of this 
finding, it may be pointed out that the 
graphs of Fig. 2 may be considered to 
represent the general form of gradi- 
ents of stimulus generalization for each 
of the three groups. The percent of 
each type of error provides a measure 
of the generalized response tendency 
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to stimuli differing from the original 
stimulus in color, in position, and in 
both color and position, respectively. 
(The strength of the ‘correct’ re- 
sponse tendency, the 100 percent point 
on the ordinate of the graph, may be 
tentatively assumed to be given by 
the total error score of the control 
group, though no special significance 
is attached to this assumption.) 
The horizontal axis may then be con- 
ceived to represent a scale of decreas- 
ing similarity, with identity at the 
origin. Thus, a stimulus differing 
from the original (i.e., the actually 
presented) stimulus in color is fairly 
similar, a stimulus differing in position 
is less similar, and a stimulus differing 
in both color and position is least 
similar. The graph may then be read 


in this way, considering first the values 
for the control group: when an indi- 
vidual learns a response to a given 
stimulus, he acquires also a tendency 


to make the same response to a stimu- 
lus which differs in color; a somewhat 
weaker response tendency to a stimu- 
lus which differs in position; and a 
still weaker response tendency to a 
stimulus which differs in both color 
and position. 

Maintaining this view of these re- 
sults, what can be seen to be the 
effect of specific discrimination train- 
ing on the generalization gradient? 
The effect of continued practice on a 
discrimination learning task is to 
reduce the generalization tendencies, 
as some of Pavlov’s (3) results indi- 
cate. The present results go a little 
farther than this, in that they suggest 
that discrimination training reduces 
the generalization tendencies not only 
to the specific cues on which training 
has been given, but also to cues 
which are easier to learn than these 
specific cues. Another way of stating 
this result is as follows: specific dis- 
crimination training makes the stimuli 
of a task easier to discriminate in a 
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subsequent learning situation, not 
simply on the basis of the practiced 
aspects of stimulation, but in the 
sense of reducing the over-all difficulty 
of discrimination learning. 

It must be admitted that the pres- 
ent study was not designed to yield 
specific evidence on the effect of 
practice on gradients of generaliza- 
tion. Consequently these results 
merely suggest what this relationship 
might be. They do, however, serve 
to emphasize the inadequacies of our 
knowledge concerning this important 
concept. When a discrimination is 
established to tones of 700 cycles/sec. 
and 600 cycles/sec., what happens to 
the generalized response tendencies to 
tones between 600 and 700 cycles? 
In the same situation, what happens 
to the generalized response tendencies 
to tones below 600 and above 700 
cycles? The present findings suggest 
that the result of discrimination train- 
ing is to effect a reduction of general- 
ization tendencies which is not con- 
fined to the specific stimuli used in the 
training, but applies to the whole 
range of the generalization gradient. 

The results have implications, too, 
for certain practical problems which 
arise in connection with the training of 
discriminative motor skills. First of 
all, they serve to verify the general 
implication of a previous study (1), 
that training on a part of the task, 
when that part is a component dis- 
crimination, may have a great deal of 
positive transfer value for the learning 
of the total skill. Secondly, and this 
is their specific contribution, they 
emphasize the fact that one kind of 
discrimination training may be con- 
siderably more effective than another 
kind in transferring to the total skill, 
particularly if the number and type 
of errors made is an important con- 
sideration. If one were forced to 
choose which one of two or more dis- 
criminations were to be used for pre- 
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liminary training in a practical situ- 
ation, these results suggest that the 
most difficult discrimination should be 
chosen. Practice on the most difficult 
discrimination, according to our re- 
sults, reduces the errors not only to 
the practiced discrimination, but also 
to other, easier, discriminations. 


(Manuscript received July 5, 1949) 
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AN EMPIRICAL EXPLANATION OF THE SKEWNESS 
OF THE BOWED SERIAL POSITION CURVE 


BY ALAN RIBBACK AND BENTON J. UNDERWOOD 
Northwestern University 


INTRODUCTION 


Few learning phenomena are so 
universal and yet so resistant to ac- 
ceptable theoretical explanation as 
the bowed serial position curve. The 
bowed curve appears in the learning 
of many serial tasks but is most fre- 
quently studied in human learning of 
serial verbal lists. In learning a list 
of serial items it is common observa- 
tion that the initial items in the list 
are acquired first, closely followed by 
the acquisition of those near the end, 
with slowest learning taking place for 
items near the center of the list. 
More precisely, it is found (when re- 
cords for a group of Ss are averaged) 
that the items just past the middle are 
the last to be learned. It is this skew- 
ness in the curve, and not directly the 
bowed curve itself, with which the 
present study is concerned. In order 
to show the approach we have taken 
to the problem, it will be necessary 
first to analyze the nature of serial 
learning. 

The bowed serial position curve 
establishes the fact that in serial 
learning the tendency is to build suc- 
cessively from the extremes of the list, 
toward the middle. With the excep- 
tion of the first and last items in a list, 
each item is bi-functional. The first 
item is a stimulus (S) only; the last 
item is a response (R) only, but each 
item within the list functions both as 
a response to the item before it and as 
a stimulus to the item following it 
(RS). For expository purposes we 
may identify two forms of learning 
which occur in the acquisition of a 
serial list, A-Learning and B-Learning. 
A-Learning occurs during acquisition 


of items at the front of the list, B- 
Learning during the acquisition of 
items at the end of the list. 

In the operation of A-Learning a 
response is given to the first stimulus 
item (S,;-—+R,). That response, in 
turn, becomes a stimulus for the next 
item (S,; — R,S; — R,), and the proc- 
ess continues as successive responses 
assume stimulus function (S,; — R,S, 
=> R,S; > R,S, => Ry er »¥ In the 
operation of B-Learning in, let us say, 
a 12-item list, the last item is given as 
a response to the stimulus item im- 
mediately preceding it (Si2—+ Ris); 
that stimulus in turn becomes a re- 
sponse to the stimulus item which 
precedes it (Si: —» Ri:Si2 — Ris), and 
the process continues as successive 
stimuli assume response function (. . . 
Ss—R Siro RieS 11 Rus Si2 > Ri). 

The above analysis represents an 
abstraction of the empirical observa- 
tion that serial lists are learned ‘from 
both ends toward the middle.’ Un- 
doubtedly, serial learning is not as 
uncomplicated as the above analysis 
suggests, but the bowed serial position 
curve is evidence that the inferred 
processes could exist. Furthermore, 
the skewness of the bowed curve implies 
that A-Learning proceeds at a more 
rapid pace than does B-Learning. 
If the two forms of learning took place 
at equal rates, the last item learned 
should be the middle one; since the 
last item learned is past the middle, 
A-Learning must take place more 
rapidly than B-Learning. If it can be 
shown independently, i.e., out of the 
context of serial learning per se, that 
A-Learning takes place at a more 
rapid rate than B-Learning, the cause 
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of the skewness in the serial position 
curve would appear to be a straight- 
forward inference. The present study 
examines the relative. speed with 


which A-Learning and B-Learning 
take place. 


PROCEDURE 


General procedure.—-The experiment con- 
sisted of two basic conditions only, one empha- 
sizing A-Learning and the. other B-Learning. 
Each condition required § to learn an original 
list (OL) and a derived list (DL). The OL of 
both conditions consisted of six pairs of nonsense 
syllables presented in five different random or- 
ders on successive trials until S anticipated cor- 
rectly all items on a single trial. Thus, OL con- 
sisted of paired-associate learning of six items. 

The DL in both conditions consisted of the 
same pairs of syllables used in OL except that to 
each pair was annexed a third syllable so that 
there were six groups of three syllables each. 
For each group of syllables S was required to 
make two anticipations in a manner to be de- 
scribed shortly. These six groups were pre- 
sented to S in three different orders until all 
anticipations were given correctly on a single 
trial. 

In the DL for Cond. A the annexed syllable 
was placed at the end of each pair and thus func- 
tioned asa response. The paradigm of A-Learn- 
ing was thus simulated; in OL S had learned 
S,—> Ry, and in DL he learned 8, —» RiS:-> Ro. 
In the DL for Cond. B the additional syllable was 
placed at the beginning of each pair, and thus 
functioned as a stimulus. Again, symbolizing a 
pair of items in OL as S$, — Ry, the DL of Cond. 
B became S:—* R2S,-+ Ry. Using a single pair 
of OL syllables, the two conditions may be seen 
operationally as follows: 


OL DL: Cond. A 
QEL (S:) QEL (S;) 
FIP (Ry) FIP (R,S2) 

MYD (R;) 


DL: Cond, B 
MYD (S,) 
QEL (R,S;) 
FIP (R;) 


From the illustration it can be seen that in DL 
for Cond. A S continued to make R; to S;, but 
that in addition R, became a stimulus (S,) for 
Rg, the latter being the annexed syllable. This 
paradigm, therefore, is an abstraction of the 
process required in learning the first part of a 
serial list. In DL of Cond. B, on the other hand, 
S; of OL became Rg, and the annexed syllable 
became S;. The learning of each group of words 
under Cond. B represented the learning required 
in the latter part of a serial list. It will be re- 
membered that there were six pairs of items in 
OL; thus, in DL under each condition, S$ was 
required to give 12 responses, six being those 
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already given in OL and six being new responses. 
To account for the skewness in the bowed serial 
position curve, the learning of the six new re- 
sponses in Cond. A must take place more rapidly 
than the learning of the six new responses in 
Cond. B. 

Specific procedure.—The materials were pre- 
sented on a serial-type memory drum geared to 
the usual two-sec. rate. In OL, each pair of 
items was separated by a hyphen so that S saw 
first the stimulus syllable of a pair for two sec., 
then the response member for two sec., and 
finally, a hyphen for two sec. As the hyphen 
moved out of the window the stimulus syllable 
for a new pair appeared, and so on until all six 
pairs were presented. Ss were required to spell 
the second member of each pair upon the pres- 
entation of the stimulus member. Thus, while 
the pairs did not appear in horizontal fashion as 
is customary in paired-associate learning, the 
learning was distinctly of that nature. As men- 
tioned above, five different orders of presentation 
of OL were used. Each order was separated by 
a two-sec. interval and noted to S by a strip of 
tape across the list. 

In DL the presentation was exactly the same 
as in OL except that three items were presented 
between each hyphen and S had to anticipate 
(by spelling) two items. Approximately one 
min. elapsed between the completion of OL and 
the first presentation of DL, 

The 24 Ss were given two practice days before 
the experimental days. On one of the practice 
days S was given Cond. A, and on the other, 
Cond. B. ‘Twelve Ss had the practice order AB, 
and 12 had the order BA. For the first 12 Ss 
the experimental’ conditions were in order BA, 
and for the other 12 the order was reversed. For 
all four days combined the order was ABBA for 
12 Ss, and BAAB for 12 Ss. On the practice 
days Ss were given detailed instructions in learn- 
ing. No mention was made of the nature of the 
conditions, or of the condition S might expect on 
any particular day. During the experimental 
days the instructions were as follows: 


Prior to OL: “This is a new list. It consists of 
six new pairs of syllables which you are to learn 
in the same manner as the ones you have learned 
before to one perfect trial.” 

Prior to DL: “This next list contains the same 
syllables as before but each group has an addi- 
tional syllable. Watch the list the first trial, 
learning the syllables but not responding to 
them. From then on give as many responses 
as you can until one perfect trial.” 


The lists constructed for use on practice days 
consisted of nonsense syllables with an average 
association value of 80.0 percent as determined 
by Glaze, while those used on the two experi- 





SERIAL POSITION CURVE 


mental days were made up of syllables of 46.7 
and 53.3 percent association value. The lists 
were constructed according to the following 
rules: 


1. In OL no vowel or consonant was used 
more than once among the six stimulus items 
and no vowel or consonant was used more than 
once among the six response items. 

2. In OL there was no repetition of a con- 
sonant or of a vowel for any pair of syllables. 

3. Of the 18 syllables comprising a DL, there 
were no duplications among initial consonants 
and no duplication among final consonants. 

4- In a DL there was no repetition of a con- 
sonant or vowel in the three syllables which 
formed a learning unit, and no repetition of con- 
sonants or vowels among syllables having like 
functions, ¢.g., the annexed syllables. 


Four sets of lists were employed, one for each 
day. These were held constant with conditions 
counterbalanced as noted above. For the prac- 
tice days a Cond. A list and a Cond. B list were 
constructed from each of the two sets of syllables. 
The lists for the experimental days posed a 
special problem. For each set OL would be 
identical whether S was serving under Cond. A 
or under Cond. B. If the annexed syllable for 
DL was the same syllable for Conds. A and B, 
we would have, as desired, varied the function of 
the syllable, but we would have also varied the 
orders for the two conditions. And, while the 
rules for constructing the lists were rigid, it is 
possible that differences in difficulty could be 
produced by the different orders per se. This 
difficulty would be independent of variations 
produced by the differences in the functions of 
the annexed syllable. 

The problem was resolved by counterbalanc- 
ing the syllables of each set. A DL was con- 
structed of six groups of three syllables each. 
The six initial syllables of each group were des- 
ignated a, the six middle syllables 6, and the six 
final syllables c. The three categories were then 
systematically varied and a list was constructed 
for every possible permutation obtained. These 
permutations were as follows: 


DL: 
Cond. A Ss 


OL Ss 

a-b 1& 13 
a-c 3&15 
b-a 5 &17 
be 7& 19 
c-a 9 & 21 
eb 11 & 23 


DL: 
Cond. B 
c-a-b 
b-a-c 
c-b-a 
a-b-c 
b-c-a 
a-c-b 


a-b-c 
a-c-b 
b-a-c 
b-c-a 
c-a-b 
c-b-a 


2& 14 
4&16 
6& 18 
8 & 20 
10 & 22 
12 & 24 


The materials for both the first and second 
experimental days were counterbalanced in the 
above fashion. Thus, in both conditions, all 
syllables appeared in all possible positions and 
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TABLE I 


Measures or Onicrnat-List Leaaninc 








Total Cor- 
rect Re- 
sponses 


One Per- 
fect Trial 





M M 








Cond. A 


29.75 | 2.20 
Cond. B 


33-45 | 3-13 














differences which are found in the learning under 
the conditions cannot be attributed to the lists or 
their order. 


REsuLtTs 


Learning of original list.—Three 
measures of OL learning are shown in 
Table I: (1) the trial on which a cor- 
rect response had been given at least 
once for each item, a criterion which 
we will call early learning; (2) mean 
number of trials to reach a criterion of 
one perfect trial; (3) mean total cor- 
rect responses on all trials. None of 
the differences between Conds. A and 
B for these three measures is signifi- 
cant. The largest t-ratio obtains 
between the conditions for the early 
learning criterion, but is only 1.47. 
With 24 Ss a ¢ of 2.07 is required for 
the five percent confidence level. We 
may conclude that the learning rates 
for OL did not differ appreciably for 
the two conditions. 

Learning of derived list.—The criti- 
cal data of the experiment concern 
the learning of DL. These data are 
shown in Table II. The significance 
of the four different response meas- 
ures in Table II will be pointed out 
separately. 

In terms of the early learning cri- 
terion, the rate of acquisition of DL 
is much faster for Cond. A than for 
Cond. B. The mean difference, 3.75 
trials, gives a t of 3.95, which is sig- 
nificant far beyond the one percent 
level of confidence. In terms of trials 
to reach a criterion of one perfect trial 
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TABLE Il 
Measures or Dearivep-List Leaaninc 











Early Learning 


One Perfect Trial 





M M 


M oM 





8.83 
12.58 


12.88 
15.17 














1.21 
1.33 


11.54 
14-33 








7 
7.83 








the difference between the two condi- 
tions, while in the expected direction, 
does not attain a high level of statisti- 
cai significance. The mean difference 
vields a ¢ of 1.82; this difference might 
occur by chance approximately once 
in 10 times if the true difference were 
zero. However, since the early learn- 
ing criterion showed a highly signifi- 
cant difference, it is probable that the 
difference in trials to reach one perfect 
is not due tochance. The decrease in 
significance may be accounted for in 
part by the increase in variability of 
scores as learning proceeded. This 
increase is evidenced in the error terms 
given in Table II. 


The differences in learning rate for 
the two conditions are shown graphi- 
cally in Fig. 1. The learning curves 
are plotted in terms of mean number 
of trials required to reach the succes- 
sive criteria of 1, 2,3 . . . 12 correct 
responses. It is apparent that the 
differences between the two conditions 
are consistent throughout learning. 

The DL responses may be divided 
into R, and Rg, responses. R, re- 
sponses had previously been learned 
in OL. The position of R; in any 
group of three syllables in DL is in 
the middle for Cond. A (S; —~ R,S: — 
R;), and at the end for Cond. B (S; — 
RS; R,). The Rg, syllables had 
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not previously been learned as re- 
sponses. They occur as a final syl- 
lable in Cond. A and as a middle 
syllable in Cond. B. The six R, re- 
sponses and the six Ry responses com- 
prise the 12 responses of DL. 

Table II gives the mean trial on 
which all six Ri responses were first 
given correctly and the mean trial on 
which the same criterion was attained 
for Ry responses. As would be ex- 
pected, the relearning rate for R;, is 
the same for both conditions, a clear 
indication that the critical difference 
between the two conditions is located 
in the learning of the new responses 
(R:). The learning of R, is appreci- 
ably slower for Cond. B than for Cond. 
A, the difference giving a t of 2.08. 
In each condition the criterion for the 
R; responses is reached approximately 
one trial before the total list is learned. 
Thus, some forgetting or threshold 
oscillation of R, responses is indicated. 

The difference in rates of Rg learn- 


8 
MEAN NUMBER 


Derived-list learning curves for R; and Rs responses 


9 © 
TRIALS 


ing for the two conditions is shown 
most strikingly in Fig. 2. The re- 
learning curves for R, in DL are al- 
most identical for the two conditions 
whereas for the Ry responses the dif- 
ference is clear-cut throughout the 
course of learning. ’ 


Discussion 


The results have shown conclusively 
that B-Learning is slower than A- 
Learning. If A-Learning represents 
the process required to learn the first 
part of a serial list and B-Learning the 
last part, the skewness of the bowed 
serial position curve is explained at 
the empirical level. 

It is noted that in most serial posi- 
tion curves the last item in the list is 
given correctly first a trial or two later 
than the first item. It therefore 
might be argued that the skewness in 
the curve could be*accounted for on 
this basis without invoking different 
rates of A- and B-Learning. That is, 
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if the last item in the list is given later 
than the first item in the early part of 
learning, the subject could build from 
the front of the list toward the middle 
and from the last of the list toward the 
middle at the same rate and still the 
curve would be skewed. Our con- 
tention is that the rates are different, 
and in order to support such an hy- 
pothesis we must show that in the 
actual serial learning situation the 
rates are different. There are no 
completely satisfactory data available 
on this problem since no one has 
systematically rotated position of 
items in a list. Consequently, some 
variation in the data may be intrinsic 
to the items as such and not to their 
serial positions. 

Data brought together by Hull et al. 
(1) on the learning of serial nonsense 
lists provide a fairly clear-cut sub- 
stantiation of our hypothesis. Data 
gathered by Shipley are presented 
(Table 23) which show the mean num- 
ber of failures preceding the first suc- 
cess at various serial positions. Ina 
2o-item list the following mean 
number of failures occurred at serial 
positions 1, 2, 3, 18, 19, 20: 2.2, 3.6, 
4-2, 8.0, 6.0; and 3.2. The difference 
between positions 1 and 2 was 1.4, 
between 2 and 3, 0.6, between 18 and 
19, 2.0, and between 19 and 20, 2.8. 

As an index of slope we have cal- 
culated the differences between the 
mean number of failures for the 1st 
and 2nd positions, for the 2nd and 3rd, 
18th and 19th, and 19th and 2oth. If 
the learning is proceeding at the same 
rate at the first and last parts of the 
list, the difference between the rst and 
2nd serial positions should be approxi- 
mately the same as that between the 
19th and 2oth, and the difference be- 
tween the 2nd and 3rd should be the 
same as between the 18th and 1gth. 
It is obvious that this is not the case. 
The differences at the end of the list 
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are greater than those at the begin- 
ning, thus supporting the idea that 
learning at the end of the list proceeds 
at a slower pace. 

Other data from Shipley given in 
the same source show that the same 
principle holds for a 14-unit list, but 
there is one reversal for an 8-unit list. 
This reversal is removed, however, if 
we average the two differences at 
each end and compare them. Data 
gathered by Hovland also show the 
same trend as those noted in detail 
above (e.g., Tables 2 and 18 in Hull 
et al.). In view of the fact that all 
these data have been gathered on the 
learning of lists in which items were 
not rotated, occasional reversals can- 
not be taken as evidence against our 
hypothesis. It is our conclusion, 
therefore, that not only does it take 
longer for the last item to be acquired 
than it does the first, but the addition 
of items ‘in front of’ the last one is 
slower. 

It must be clear that we have offered 
no hypothesis to account for the differ- 
ent rates of learning in the first and 
last portions of a serial list (A-Learn- 
ing vs. B-Learning), nor have we sug- 
gested any hypothesis to account for 
the bowed serial curve per se. Our 
starting point was the observation 
that the acquisition of items in a serial 
list commonly proceeds from the first 
of the list toward the middle and from 
the last of the list toward the middle. 
We then abstracted these two situ- 
ations from the context of a serial list 
and showed that the learning took 
place at different rates. We submit 
these data to account empirically for 
the asymmetry of the serial position 
curve. 


SUMMARY 


Following the learning of an original 
list of paired nonsense syllables, two 
conditions of learning a derived list 
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were introduced: (1) Cond. A, in 
which a new syllable was appended to 
each original pair and thus became a 
response to the second syllable of the 
pair, and (2) Cond. B, in which a new 
syllable was prefixed to each original 
pair and thus became a stimulus for 
the first syllable in the pair. In terms 
of our analyses, Cond. A represented 
the process required in the acquisition 
of the first part of a serial list as suc- 
cessive responses acquire stimulus 
function. Cond. B, on the other 


hand, represented the process required 
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in learning the last part of a serial list 
as stimuli acquire response function. 
Since the learning under Cond. A was 
more rapid than under Cond. B, we 
have concluded that at the empirical 
level these differences in rate of learn- 
ing will account for the skewness of 
the bowed serial position curve. 


(Manu: pt received June 10, 1949) 
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A REMOTE ASSOCIATION EXPLANATION OF THE 
RELATIVE DIFFICULTY OF LEARNING 
NONSENSE SYLLABLES IN A 
SERIAL LIST 


BY B. R. BUGELSKI?! 
Unwersity of Buffalo 


INTRODUCTION 


In 1935 Hull (6) proposed a deduc- 
tive technique by which theorems con- 
cerning rote learning could be derived 
from a set of postulates and defini- 
tions. As a part of the system, he 
extended the Lepley hypothesis (10) 
to account for the greater difficulty of 
learning syllables in the middle of a 
serial list. His deduction was based 
on the assumption that early syllables 
established trace conditioned response 
associations with syllables later in the 
list. Such trace conditioned re- 


sponses were supposed to be held in 


check by an assumed ‘inhibition of 
delay.” By counting the number of 
such trace associations spanning each 
item in a serial list, the alleged amount 
of inhibition preventing the occurrence 
of each response could be calculated. 
Plotting such values resulted in a sym- 
metrical curve with zero values at 
either end and a peak in the middle.? 
Such a curve does not correspond with 
the factual data gathered in many 
experiments that show that the great- 
est difficulty comes, not at the middle, 
but just beyond the middle of the list 
(5, 17, 18). 

The present writer believes that 
Hull’s system can approximate experi- 


1 The substance of this paper was presented 
at the 1948 meetings of the American Psycholog- 
ical Association in Boston. The writer is in- 
debted to B. Lyndon for assistance in obtaining 
verbatim records of the subjects. 

* This curve has been reproduced by Hilgard 
and Marquis (4), McGeoch (13), and Raskin and 
Cook (16), and so it needs no repetition here. 


mental data more closely if his Postu- 
lates I and II (stimulus trace and 
afferent neural interaction) are more 
completely exploited. The stimulus 
trace postulate, for example, states 
that there is a negative acceleration in 
the diminution of stimulus trace 
strength. In the 1935 deduction 
Hull does not make use of this infor- 
mation, but treats all of the spanning 
associations as equal in strength. 
Likewise, no mention is made of the 
possible interaction among stimulus 
traces in terms of competition for as- 
sociation such as is implied by Hull 
(g) where he speaks of competition of 
reaction potentials at a choice point. 
In a serial list of nonsense syllables 
where each syllable serves as both 
stimulus and response, the opportuni- 
ties for such competition are clearly 
present. By applying these two pos- 
tulates to the problem of learning a 
serial list, the phenomenon of greater 
difficulty just beyond the middle can 
be accounted for in the following 
manner. 

The remote associations that are 
alleged to be responsible for preven- 
tion of correct responses vary not only 
in number, as described by Hull, but 
also in strength. Although the mid- 
dle syllable in a list is spanned by the 
greatest number of remote associa- 
tions, such associations need not be 
more inhibitory of the response than 
some smaller number of stronger re- 
mote associations that may be span- 
ning the next-to-the-middle syllable. 
It remains to show how variations in 
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total remote association strength arise 
at different positions in a serial list. 
The strength of remote associations 
is a function of two factors: (1) the 
point in time at which a response oc- 
curs when a stimulus trace is present, 
and (2) the number of additional re- 
sponses occurring while a trace is 
active. Thus, in a list of eight syl- 
lables, the first stimulus (say an X in 
a memory drum aperture) will be 
active throughout the presentation of 
the entire series if the syllables appear 
at a three-sec. rate. The stimulus 
trace then will have an opportunity 
to be associated with not only the 
first syllable in the list but with every 
other with diminishing strength as 
the successive syllables appear. Hull 
assumes generally that stimulus traces 
are active for about 30 sec. Since the 


last syllable will appear 24 sec. after 
the X, the trace is not likely to be very 
strong and the last remote association 


should be correspondingly weak. 
Similarly for every other syllable in 
the list: each syllable serves as a 
stimulus, giving rise to a stimulus 
trace of diminishing intensity result- 
ing in a negatively accelerated gradi- 
ent of remote associative strengths. 
In a serial list each successive stimu- 
lus syllable has fewer possible remote 
associations. This results in the situ- 
ation that the last remote association 
to the first stimulus is going to be rel- 
atively weaker than the last remote 
association to the second stimulus and 
so on through the list. If we were to 
assign an arbitrary value of one to the 
most remote association possible in the 
list (that between the starting signal 
X and the last response), then all less 
remote associations should be valued 
more highly. Since there is no pres- 
ent basis for assigning exact strength 
values to the remote associations the 
writer suggests the arbitrary and 
highly speculative values indicated in 
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the table of Weighting Values in 
Table V, where a vaiue has been 
assigned to every possible remote 
association in an eight syllable list. 
An increment of one strength unit is 
added to each less remote association 
starting with the association between 
the first stimulus and the last re- 
sponse. A system of negatively ac- 
celerated values would be more ap- 
propriate and would result in pro- 
portionately higher values for the less 
remote associations, but such a system 
is both impractical and unnecessary 
for the present demonstration. 

The above description does not 
account for the progressively higher 
values assigned to the associations of 
the same degree of remoteness. Thus, 
the associations between Stimulus 1 
and Response 3 and Stimulus 6 and 
Response 8 are both of one degree of 
remoteness and are not equally valued. 
The greater value in the latter case is 
assigned on the basis of Postulate II 
which deals with competition between 
reaction potentials. In the present 
situation of an eight syllable list, it is 
assumed that when the stimulus trace 
from Stimulus 6 is initiated, there will 
be competition for associations from 
Responses 6, 7, and 8, the stimulus 
energy being shared in some manner, 
with the result that none of the associ- 
ations will be as strong as they would 
be if there were no competition. 
When the first stimulus is present, 
there will be competition for associ- 
ation from all eight possible responses. 
If we assume that each stimulus has 
the same potential trace strength, 
then it is reasonable to infer that as- 
sociations made in competition with a 
large number of others are likely to be 
less strong than those made under 
circumstances of little competition. 
This justifies us in assigning a higher 
value to associations of the same de- 
gree of remoteness if they occur later 
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in a list because there is less competi- 
tion possible toward the end. Ac- 
cordingly (reading in rows across the 
table of values) we arbitrarily assign 
an increment of one to each later first 
association. 

With such a system of values at 
hand it is possible to take account of 
both frequency and strength of remote 
associations in a serial list and, by 
plotting these, obtain a theoretical 
expectancy curve of the relative 
difficulties of learning at the several 
points in a serial list, when such diffi- 
culties are measured by the frequency 
of remote associations. Comparison 
of such a theoretical curve with actual 
data should permit some test of the 
postulates involved. In the present 
experiment such a comparison is at- 
tempted along with an additional 
attempt to demonstrate the negative 
gradient in the stimulus trace (Pos- 
tulate I). 


APPARATUS AND PROCEDURE 


The present experiment was instigated by a 


paper of Raskin and Cook (16). These writers 
challenged the validity of Hull’s Postulate I on 
the basis of data consisting of 95 remote associ- 
ations accumulated from 19 subjects tested for 
associative recall by a free association procedure 
after learning a list of eight syllables to a criterion 
of three perfect repetitions. Their basic finding 
was that remote associations diminished in fre- 
quency up to the third degree of remoteness, in- 
creasing thereafter. 

Raskin and Cook used a list of eight syllables 
allegedly equal in associative value taken from 
Hull’s list. Their list and the order of syllables 
in the list could be chosen hardly less fortunately. 
Three of the syllables begin with the letter Y 
and, as will be shown, tend to be difficult to learn; 
four of the syllables tend to be relatively easy to 
learn in the position in which they appeared and 
markedly affected the learning. It will be shown 
that there were gross differences in the ease of 
learning of the syllables, both, as such, and in 
terms of positions in which they appeared. 
Their use of an association test is considered un- 
suitable as such a procedure is useful only after 
learning instead of during learning when remote 
associations occur frequently. 
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In view of the above considerations it was felt 
that a fresh approach to the problem was called 
for inasmuch as the conclusions of Raskin and 
Cook challenged the earlier findings of Hall (2), 
Mitchell (14), and Lumley (11). The evidence 
for a remote association gradient is, at present, 
sketchy. Philip and Piexotto (1g) reported 
“a slight gradient for serial position which is 
more pronounced for the shorter than the longer 
series.” McGeoch’s (13) general review leaves 
the issue unresolved. In his summary of the 
evidence, McGeoch could not decide upon the 
question of how remote associations distribute 
themselves. The present study offers data on 
which a reasonably reliable conclusion can be 
based. 

Subjects.—128 college students from courses in 
Experimental Psychology were used. All of the 
subjects had previous practice in nonsense syl- 
lable learning and experience with memory 
drums. None of them was familiar with the 
nature of the problem under investigation. The 
subjects had never used the spelling out method 
in anticipation learning and were given practice 
in this technique with a list of masculine names 
until it was clear that they understood the pro- 
cedure. 

Procedure.—Each subject learned a list of 
eight nonsense syllables for eight trials. The 
syllables appeared in the aperture of a memory 
drum for three sec. There were nine sec. be- 
tween the end of one trial and the beginning of 
the next. A large X appeared in the memory 
drum as the cue for the first syllable. The sub- 
jects were instructed to respond to all stimuli by 
spelling out the letters of the next syllable if pos- 
sible but if that proved impossible to try to say 
something, even if incorrect, and even if the 
subject recognized his response as incorrect. If 
each subject responded to each stimulus, on each 
trial, he would have a record of 64 responses. 
The responses were recorded verbatim. 

Materials.—The syllables were taken directly 
from the list used by Raskin and Cook in order 
that direct comparisons could be made with their 
results. Because of the fact that position effects 
were being studied, it was considered necessary, 
if any adequate conclusions were to be drawn, 
for the syllables to be of equal difficulty. If a 
single syllable proved to be easier to learn than 
some others it would affect the possibility of 
demonstrating any specifically positional effects. 
If the middle syllable, for example, proved ex- 
ceptionally easy to learn, either as a response to 
its predecessor or because of some intrinsic 
feature, it might be found that the middle of the 
list is most easily learned. Raskin and Cook 
appear not to have taken this factor into con- 
sideration and acted upon the assumption of 
equal associative value. The present writer, 
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TABLE I 


ARRANGEMENT OF THE E1:cutr Syiiastes in E:cut Dirrerent Lists, Eacu List 
Contarntnc THe Same Syiiasies in Dirrerent Posrrions ano WITH 
Durrerent Precepinc anp Succeepinc SyLiasBies * 








KEX YIK NY ZAL 


YUT NUV 





YUT ZAL MIB KEX 


YIK NIJ 





ZAL NUV YUT YIK 


YEV KEX 





NIJ KEX YEV NUV 


ZAL MIB 





YIK YEV KEX YUT 


MIB ZAL 





MIB YUT NUV NIJ 


KEX YEV 





NUV MIB ZAL YEV 


NIJ YIK YUT 





YEV NIJ YIK MIB 














NUV YUT KEX 














* The procedure for developing such squares has been described in a previous paper by the writer 


(1). 


although harboring misgivings about their choice 
of syllables, decided to control such effects as far 
as possible by arranging the syllables in eight 
different lists so structured that each syllable 
occupied each position once and only once, and 


followed each of the other syllables once and only 


once in each list. This arrangement resulted in 
the Latin Square appearing in Table I. The 
list used by Raskin and Cook is List I. Each 
of the eight lists was learned by sixteen subjects. 


REsuULTs 


1. The equality of the syllables.— 
The first question requiring considera- 
tion is that of the alleged equality of 
the syllables. The number of times 
each syllable was given correctly in 
its proper order is listed in Table II. 
It is at once obvious that not only were 
the syllables unequal in difficulty but 
that such inequalities were of a gross 
nature. The syllable MIB for in- 
stance, was correctly anticipated 49 
percent of the time (total for all posi- 
tions), YIK was correctly anticipated 
only 21 percent of the time. MIB, 
thus, was learned more than twice as 
well as YIK (critical ratio = 13.5). 
The syllables are arranged in rank 
order of learning and the percentages 





of correct anticipations are indicated 
at the right of the table. Any differ- 
ence between percentages of six per- 
cent or more is significant at the one 
percent level. Examination of the 
table indicates that at every position 
in the list some syllables had an ad- 
vantage over the rest, some being 
given correctly only a few times at 
some positions, frequently at others. 
The use of any one list of syllables for 
purposes of studying position effects 
seems a completely untenable practice 
and serious doubts may be raised con- 
cerning any experiment using only 
single lists for such purposes. The 
balancing procedure followed in the 
present experiment is obviously not 
completely adequate. To be so, it 
would be necessary that each syllable 
have about the same learning score at 
each position even though the scores of 
the different positions varied for all of 
the syllables. The remainder of the 
analysis depends upon the extent to 
which the inequalities of the syllables 
are overcome by the averaging process 
of summing their values and treating 
the scores by position value alone. 
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2. The remote association gradient.— 
In Hull’s system, it is assumed that 
the strength of the stimulus trace 
diminishes with negative acceleration. 
Raskin and Cook interpreted this as- 
sumption to mean that remote asso- 
ciations would be less numerous with 
increasing separation between syl- 
lables in a list; thus, there should be 
relatively fewer responses indicating 
an association between Stimulus I and 
Response VIII than between I and 
III. The intervening syllables should 
occur as intrusions or remote associ- 
ations with diminishing frequency, in, 
according to Hull, a negatively ac- 
celerated fashion. The data bearing 
on this question appear in Table III. 
The number of times each syllable was 
given in place of the correct (direct) 
association is indicated by position 
scores; thus, instead of responding 
with the first syllable when X ap- 
peared in the memory drum, the sub- 
jects responded with the second syl- 
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lable in the list 27 times, with the 
third syllable 8 times, andsoon. The 
total number of such associations ap- 
pears at the right of the table along 
with the number of miscellaneous 
errors (one and two letter responses, 
unidentifiable errors, etc.) and the 
number of failures to respond. In 
order to evaluate the frequency of 
each degree of remoteness, the data of 
Table III are arranged in Table IV to 
indicate the number of remote forward 
and backward associations made to 
each stimulus. The backward associ- 
ations are included in the table for 
what they are worth, but will not be 
considered in the discussion because 
they were too infrequent to evaluate 
systematically. 

The remote forward associations to 
the first stimulus drop from a total of 
367 at the first order remote to 7 at 
the seventh order remote. It is, of 
course, impossible for any other stimu- 
lus syllable except the first, to have the 


TABLE II 
Tora Numper or Correct Responses ror Eacu Stimutus Sytiasie Listen Accorpinc 


To Tue Posrrion or THe SyYLiasBLe iN THE List. 


Each syllable occupied a different 


position in eight counterbalanced lists (see Table I). Totals are for eight 


learning trials for 128 subject. 


Syllables are listed at the left in 


their rank order of ease of learning. 
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TABLE Ill 


Totrat Numper or Correct ayy Remote Assocrations (Taree Letrers Correct, 
But tn Wronc Posrriow) Given tw Escutr Trrats sy 128 Supyects * 


Responses Listed by Position 
Remote Associations 





Total 


Total " Total 
Corr. > . | Forward 





697 53 
$3 82 


11t 


260 














10 6 6 384) 384 54 











* The table should be read from left to right, thus, to Stimulus 1, Response 1 was made twice, 
Response 2, 531 times, Response 3, 39 times, etc. All underlined numbers represent correct responses 
for the stimuli appearing on the drum, all other syllable responses are remote associations either 
forward or backward. Miscellaneous errors included responses of three or fewer letters which could 
not be accounted for exactly on the basis of the syllables used. Attempts to find any regularity 
among the two-letter correct remote associations proved fruitless. ‘No response’ refers to failure 
of the subject to say anything; if these are interpreted as ‘olocks’ some interesting speculations arise. 

** The appearance of X in the memory drum slot was the stimulus for Response 1. 
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Remote Forwarp anp Bacxwarp Associations (Turee Letrers Correct) Arrancep 

Accorpinc To Decree or Remoteness. Total for eights learning trials for 128 subjects. 

Note that the possible number of degrees varies for each position. (Backward associations 
are included for completeness but are not considered in the discussion.) 
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possibility of seven forward remote 
associations, and, in fact, each suc- 
ceeding syllable has one less remote 
forward association possible. A 
glance at Table IV shows that all but 
the last stimulus could offer the op- 
portunity of at least a first degree re- 
mote association. These first order 
remote associations appear in column 
1. All but two of the stimuli could 
have second degree remote associ- 
ations; these appear in column 2. 
With diminishing possibilities, the 
rest of the remote associations appear 
in succeeding columns. It is of course 
impossible simply to sum these associ- 
ations because of the preponderance 
of the early remote associations. In 


' Sam 


MEAN MUMBTR OF REMOTE ASSOCIATIONS 
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the present instance, there is the pos- 
sibility of, at least, averaging the num- 
ber of each degree of remote associ- 
ation by dividing their number by the 
number of syllables capable of having 
each degree of remoteness. This pro- 
cedure has the disadvantage that the 
averages have less and less meaning 
as the number of syllables decrease so 
that the last average is no average at 
all but is simply its own total. If the 
averages for the first few syllables can 
be considered to have any meaning at 
all and their values plotted, we have 
the negatively accelerated curve of 
Fig. 1. This curve corresponds with 
Hull’s assumption. With a longer 
list of syllables, more of the mean 








d 2 3 


Fic. 1. The average number of remote associations at each degree of remoteness. 
the text, each point is calculated on a progressively smaller number of possibilities of association. 
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values might be reasonably reliable 
and might prove more substantial 
support for the gradient assumption. 

3. Relative difficulty of learning at 
the different positions in the list.—The 
argument concerning the possible 
effects of remote associations in deter- 
mining ease of learning ratios has been 
presented in the introduction. 

It remains to see how nearly the 
actual data correspond to ratios of the 
arbitrary value system. The number 
of remote associations found between 
each combination of stimuli and re- 
sponses is indicated in Table III. 
From this table, the various types of 
remote associations have been trans- 
cribed to their corresponding positions 
in Table V (see column c). The 
totals of the columns in Table V pro- 
vide the basis for the three curves in 
Fig. 2. The first set of totals (column 


a) represents simply the number of 


® ® © 
16 B26 B2 
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remote associations spanning each 
direct association. This corresponds 
to Hull’s original (1935) counting pro- 
cedure. Since no associations are 
considered to be spanning the initial 
stimulus and the final response, these 
items have zero values so that the 
curve starts and ends with zero. The 
intervening values show 7, 12, 15, 16, 
15, 12, and 7 spanning associations. 
On the basis of this symmetrical dis- 
tribution, Hull originally deduced the 
greatest difficulty to be in the exact 
middle of a list. The second set of 
totals (column b) represents the sums 
of the arbitrary values assigned to 
each remote association. It is evident 
at once that these rise toward the 
middle and reach their peak beyond 
the middle before declining to the the- 
oretical zero. It is important to note 
that the symmetry is no longer pres- 
ent, that the later positions are higher 
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POSITION OF STIMULUS SYLLABLES IN THE LIST 


Fic. 2. 


Comparison of the empirical data with Hull’s 1935 deduction and with the prediction 


derived in the present study. Columns (a), (b) and (c) are ordinate scales for the three curves show- 
ing respectively Hull’s arrangement, the present theoretical arrangement and the actual data of the 
present experiment. The sizes of the steps for the three separate scales have been converted to 


occupy the same graphic range. 
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(more difficult) than the early posi- 
tions. Thus, not only is the greatest 
difficulty found beyond the middle, 
but the last and next to the last posi- 
tions are relatively more difficult than 
the first and second (a typical labora- 
tory finding). The sums of the third 
columns make up the third set of 
values plotted in Fig. 2. These sums 
like those of the second are asymmetri- 
cal, with the highest value beyond the 
middle. Because all three sets of 
sums contain numbers which are not 
directly comparable it is necessary to 
arrange three sets of corresponding 
ordinates for the common abscissa of 
Fig. 2 in order to compare the resulting 
curves. This procedure is simpler 
than trying to reduce all three sets of 
values to the same base and it serves 
the purpose of comparing the curves 
while leaving the data in meaningful 
form. A glance at the figures shows 
the empirical data to follow the values 
derived in the present effort much 
more closely than they do Hull’s values. 
The failures of the empirical and de- 
rived curves to be superimposed may 
be due to insufficiency of data or unre- 
liability of some of the points due to 
uncontrolled sources of error, or may 
be due to the procedure of assigning 
values to the remote associations. 
Since the procedure was artificial and 
arbitrary and followed a direct pro- 
gression instead of a theoretical nega- 
tive acceleration it is likely that a re- 
finement of the value system by use of 
the theoretical curves of negative ac- 
celeration might improve the fit. To 
use such theoretical values, however, 
would involve a rather precise knowl- 
edge of the nature of the theoretical 
curve corresponding to the waning 
strength of a stimulus trace and would 
be complicated by the intertrace 
effects about which knowledge is, at 
present, absent in the degree of pre- 
cision required. 
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Discussion 


The two-fold intent of this experi- 
ment was to provide an independent 
demonstration of the negative gradi- 
ent assumed by Hull to be character- 
istic of the stimulus trace and to ex- 
plain the relatively greater difficulty 
of learning the items just beyond the 
middle of a serial list of nonsense syl- 
lables. The first objective appears 
to be met by Fig. 1. The earlier neg- 
ative results of Raskin and Cook ap- 
pear to have been artifacts of their 
method and materials. The present 
findings suggest great caution in using 
single lists of syllables in studying 
serial position effects. With regard 
to the second objective, the general 
correspondence of our theoretical and 
empirical curves appears to support 
the reasoning outlined in the intro- 
ductory section of this paper. Since 
both Postulates I and II were involved 
in that reasoning, the present results 
tend to confirm both of the postulates. 

The present problem has been and 
remains theoretically important and 
has been critically considered by 
McGeoch (12) and more recently by 
Hilgard (3). The nature and fre- 
quency of remote association as meas- 
ured by anticipatory intrusions is of 
critical experimental interest in prob- 
lems of proactive and retroactive in- 
hibition and serial reinforcement. In 
Hull’s recent study of the latter prob- 
lem (9) he reported additional evi- 
dence that first order remote associ- 
ations are stronger than associations 
of greater remoteness. If we are 
justified in drawing an analogy be- 
tween maze learning in the rat and 
human rote learning, as Hull does, it 
is pertinent to examine the animal 
data reported by Hull. In Hull’s 
maze rats were forced to choose one 
passageway from among four at four 
successive stages in the maze. If we 
assume that anticipatory errors in the 
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maze correspond to remote associ- 
ations then we find that the animals 
(after a run through the maze) can 
err in an anticipatory fashion at the 
first, second and third choice points 
(but not at the last). At the first 
choice point they can make anticipa- 
tory errors of 1, 2, or 3 degrees of re- 
moteness; at the second they can 
make errors of only 1 or 2 degrees of 
remoteness, at the third, only errors of 
the first degree of remoteness. When 
the number of such anticipatory errors 
is averaged for the three degrees of 
remoteness we find the following 
means corresponding to the degrees of 
remoteness: (1) 23.2, (2) 17.2, (3) 
14.1. Plotting these means reveals at 
least the beginnings of a negatively 
accelerated curve, corresponding to 
the present results. More to the 


point in the present argument is the 
fact that as the animals move through 
the maze from choice point 1 through 


2 and 3 they make the following num- 
ber of first degree antic,patory errors: 
15.3, 22.5, and 31.8, bearing out the 
present claim that as less and less 
opportunity is available for more 
varied responses to become associated 
with stimulus traces, those responses 
which are associated are more strongly 
established. The further justification 
of the modification of the stimulus 
trace function as here presented must 
await future objective verification be- 
sides its apriori reasonableness and its 
possible fruitfulness in prediction. 
The later suggestions of Hull (7, 
9) offer different methods of deducing 
the theorem described in the present 
paper. The latest of these makes 
use of the ‘physiological summation’ 
of the separate gradients of reinforce- 
ment in a serial learning situation 
where each stage in a serial maze habit 
is separately reinforced. In this situ- 
ation Hull found that the first stage 
was learned faster than the last, with 
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the third choice most difficult. By 
applying the obtained gradient of 
serial reinforcement to the learning of 
nonsense syllables, Hull is able to show 
that the point just beyond the middle 
of a list would have a lower reaction 
potential than any other point. The 
deductions involved in this predic- 
tion make use of the concepts of serial 
reinforcement, anticipatory and per- 
severative errors, and the assumption 
that secondary reinforcements func- 
tion in nonsense syllable learning by 
the method of anticipation. Assum- 
ing the validity of the new deductions, 
we have, then, an additional explana- 
tion for the same general phenomenon 
of difficulty just beyond the middle of 
a serial list. There is no reason why 
a consistent system should not be able 
to provide several ‘proofs’ of the same 
general theorem if the theorem is sus- 
ceptible of statement in terms of 
different measures of behavior. In 
Hull’s new deduction the explanation 
depends primarily on measures of 
progress in learning or number of 
trials to learn. The present paper 
makes no reference to learning trials or 
cumulative effects of reinforcements; 
it concentrates instead on the total 
number of errors (remote associations) 
made in a given amount of practice. 
Assuming the validity of the present 
explanation, it appears that the pos- 
tulates of Hull’s system are comple- 
mentary to each other and not incon- 
sistent. 


Hull’s original explanation of the 
difficulty of learning in the middle of the 
list made use of the concept of ‘inhibition 
of delay,’ which presumably involved an 
interference with the emergence of re- 
mote associations as learning progressed. 
This paper is not concerned with such 
additional processes and sheds no light on 
them. The criticism directed against 
the notion that remote associations are 
trace conditionings on the ground that 
classical trace conditioning involves a 
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long and arduous procedure whereas re- 
mote associations are quickly made, was 
first brought up by Hilgard and Marquis 
(4). This same criticism is repeated 
almost verbatim by McGeoch (13). 
This objection to the stimulus trace 
principle as functioning in serial rote 
learning demands some comment, in view 
of the fact that the assumption has been 
made throughout this paper that remote 
and trace associations are identical. It 
should be pointed out that the difficulty 
experienced by Pavlov did not consist of 
setting up trace conditioning as such but 
in establishing trace conditioning to 
stimuli which were presented at con- 
siderable remoteness (in time) before 
the unconditioned stimulus. To condi- 
tion an animal to respond to a bell which 
sounded twenty minutes before, can 
indeed be difficult. To condition it to 
shorter time intervals is apparently not 
so difficult. The basic problem was to 
keep the response from occurring before 
the experimenter wanted it to. Failures 


or errors on the part of the subject con- 
sisted in the appearance of the response 


in an anticipatory fashion. Such antici- 
patory intrusions can be considered 
equivalent to remote associations which 
occur as intrusions in nonsense syllable 
learning. With no time interval at all 
(simultaneous presentation of CS and 
UNCS) conditioning is most rapid. The 
situation involved in nonsense syllable 
learning can hardly be considered trace 
conditioning in the traditional sense. In 
effect it amounts to simultaneous condi- 
tioning to the trace of a stimulus which 
(in an eight-syllable list, with three-sec. 
time intervals between syllables) need 
not precede its future associate by more 
than 24 sec. and may be no more than 
three sec. As a matter of fact, the as- 
sociation between the first stimulus and 
the last response involving, as it does, 
24 sec. of time is very poor in strength; 
whereas the association between the 
stimulus for the seventh response and 
the eighth response (with only one syl- 
lable intervening) is formed much more 
strongly. The gradient involved in such 
trace associations, in fact, is the entire 
basis of the present account. The argu- 


ment that trace associations are difficult 
to form appears to have developed from 
a too hasty acceptance of an analogy 
first used by Hull to illustrate the trace 
concept. 


SumMARY AND CoNncCLUSIONS 


1. The present experiment was 
designed to answer two questions: (1) 
do remote associations in serial rote 
learning follow a course of negative 
acceleration with degree of remoteness 
as required by Hull’s Postulate I?, 
and (2) can such negative acceleration 
account for the often reported finding 
that nonsense syllables just beyond 
the middle of a list are most difficult 
to learn? 

2. 128 experienced subjects learned 
eight nonsense syllables for eight trials 
by the anticipations method, spelling 
out responses which were recorded 
verbatim. 

3. Remote associations were found 
to decrease with remoteness in a nega- 
tively accelerated curve up to the 
sixth degree of remoteness; the 
seventh or last degree possible showed 
a slight upward turn. 

4. First degree remote associations 
were progressively more frequent to 
syllables appearing later in the list, 
suggesting the general conclusion that 
where less associations are possible, 
those that are formed will be stronger. 

5. Because of the different strength 
of associations of each degree at 
different positions in a list, it is possi- 
ble to weight the frequency of remote 
associations with a value representing 
their strength. Such weighted fre- 
quencies provide a theoretical curve 
which shows the greatest negative 
effect of remote associations in learn- 
ing to occur just beyond the middle 
of a list. 

6. The empirical frequencies of the 
present experiment correspond more 
closely to such a theoretical curve than 
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they do to the original curve deduced 
by Hull for accounting for the relative 
degrees of difficulty of syllables in a 
serial list. 

7. The concept of the stimulus 
trace as described by Hull is consistent 
in every way with the present findings 
which are interpreted as supporting 
this Postulate and as refuting certain 
practical and theoretical criticisms 
previously advanced. 


(Manuscript received June 17, 1949) 
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INTRODUCTION 


The present experiment is an out- 
growth of a previous one (1) in which 
the functions of vernier, stereoscopic, 
and real depth discriminations were 
evaluated under strictly comparable 
experimental conditions. In that 
study, it was found that stereoscopic 
and real depth discriminations were 
similarly affected by changes in spatial 
relations of the test objects, but that 
vernier discrimination was affected 
very differently by such changes. It 
was apparent from this finding that 
stereoscopic and real depth discrim- 
inations could not be interpreted as 
merely the combination of vernier dis- 
criminations performed by each sep- 
arate eye. 

In the present experiment, vernier 
and real depth discriminations are 
again evaluated under comparable 
conditions. The experimental vari- 
able in this case is that of brightness of 
field. An important feature of the 
experiment is that field brightness was 
varied without interposing any optical 
apparatus in the line of regard from the 
eyes to the test objects. In this way, 
it was possible to explore the influence 
of brightness uncomplicated by the 
presence of other factors. 


1These experiments were performed in the 
Psychological Laboratory at Brown University. 


The relation of certain forms of 
visual acuity or resolution to bright- 
ness of field is well known (see, for 
example, the articles of Shlaer (6), 
Hecht and Mintz (4), and Byram (2, 
3)). There is no a priori reason to 
suppose, however, that vernier or real 
depth discriminations will be affected 
in the same manner as other types of 
visual discriminations by changes in 
levels of brightness. The experiments 
to be reported here lead to the con- 
clusion that at the lowest brightness 
levels both vernier and depth dis- 
criminations are indeed limited by a 
failure to resolve the test objects, but 
that at higher levels other factors 
become increasingly important. 

Mueller and Lloyd (5) have recently 
reported an experiment on stereo- 
scopic acuity as influenced by level of 
illumination. Using a relatively thick 
(20 min. of visual angle) line as a tar- 
get, they were able to explore stereo- 
scopic acuity at levels below the region 
over which cone vision predominates. 
They interpret their results in terms of 
visual acuity theory. 


APPARATUS 


The general form of the apparatus used in the 
present experiment has been described elsewhere 
(1). Only its essential features will be described 
here, together with a description of modifications 
which were made to secure the necessary range 


349 


Fase Lp igi tt tba tecd SI A PIT wn, . 














R. N. BERRY, L. A. RIGGS, AND C. P. DUNCAN 











Fic. 1. 
situation. 


solid lines represent the principal visual paths. 


of field brightnesses. Fig. 1 is a diagrammatic 
representation of the general visual situation 
which existed for both the vernier and real depth 
situations. 

The observer was seated at point O and the 
test object was placed at point X. The sub- 
jects’ eyes, E; and E, were 4622 mm. from the 
test object which was viewed by O through the 
interior of barrel B. A large aperture, with an 
attached hood of the Brewster stereoscope type, 
was cut in the end of the barrel nearest the sub- 
ject while the other end contained two circular 
apertures, C, and C,, in the horizontal plane. 
The diameters of each of these circular apertures 
subtended slightly less than 3° of visual arc. 
They were so arranged that the image of the test 
object appeared in the middle of the right and 
left apertures for the right and left eyes respec- 
tively. Both eyes viewed the test object through 
a rectangular aperture, common to both eyes, 
which was 450 mm. in front of the test object 
and subtended 1° 30’ in the vertical plane and 20’ 
in the horizontal plane. 

The intensity of the lights was controlled by 
the use of three sets of suitable filters, FFF, in 
front of light sources which illuminated various 
parts of the visual field. None of these light 
sources was visible to the subject. They are in- 
dicated by Pi, Ps, and Ps, in Fig. 1. As can 
readily be discerned from the figure, it was nec- 
essary to provide an illumination matching that 
of surface S; on surfaces S, and Ss. This was 
accomplished by the use of a half-reflecting 
mirror at M, which divided the beam from P, 
about equally between the two surfaces. There 
was also a light source in barrel B fitted with 
filters of the same type as those used in front of 
the other sources. A given brightness was first 
obtained in the barrel B for the region between 
apertures C, and C,. All other brightnesses 
were then matched visually to this one. The 


Diagrammatic representation of the vernier visual situation and the real depth visual 
The dashed lines represent the principal beams of light from the light sources. The long 


result of such matching was that the entire visual 
field appeared of a uniform brightness and color. 

The mirror shutter, M3, is shown at an angle 
of 45° from the median plane. This position of 
the shutter enabled the subject to view two 
fixation crosses on surface S,, and it also served 
to conceal the test object from view. When the 
shutter was swung out of the visual field by the 
activation of a solenoid attachment, the test 
object replaced the fixation crosses in the field, 
and it was seen against background Ss. Ms 
split the light beam from Ps in such a manner 
that S, and S, were both of the same brightness 
as the rest of the field. 

Test objects: Vernier: The vernier test object 
consisted of two blackened steel rods placed 
vertically one above the other at position X in 
Fig. 1. The test rods were 2.4 mm. in diameter 
and appeared as black vertical lines whose thick- 
ness subtended 107 sec. of arc. The vertical 
separation between them was held constant at a 
value of 3 mm. or 134 sec. of arc. Fig. 2C shows 
the appearance of the test rods as seen by the 
subject; Fig. 2B gives a top view of the vernier 
situation in which the upper rod, U, is fixed and 
the lower rod, L, is capable of being laterally 
displaced as shown. The displacement was 
regulated by means of a micrometer attachment. 

Real depth: This test object is so named to dis- 
tinguish it from a stereoscopic depth test object 
used in a previous study (1). The upper, 
stationary rod of the real depth test object was 
placed in the same position as the stationary rod 
of the vernier test object (at X in Fig. 1). The 
lower, variable rod of the real depth test object 
was movable in the median plane as indicated in 
Fig. 2A. Any lateral movement of the variable 
rod in the real depth test object was prevented by 
the action of a pair of hardened steel straight 
edges which permitted a maximum deviation of 
only 0.01 mm. (0.4 sec. of arc) from the pre- 





VERNIER AND DEPTH DISCRIMINATIONS 


eves 
A. veetx SITUATION B. VERNIER SITUATION 


Fic. 2. Schematic representation of the two 
visual situations employed in the present experi- 
ment. U and L represent the upper (stationary) 
and lower (movable) rods, respectively. Angles 
1 to 4 represent threshold displacements of the 
movable rod in terms of visual angle. 


scribed median plane. This deviation was far 
below the threshold for vernicr acuity. The 
same magnitude of error also existed in the 
vernier situation and the same consideration 
again applies. 


PROCEDURE 


General.—Three adult male subjects with a 
minimum visual acuity rating of 20/20 were 
employed in the present experiment. The only 
visual defect that was noted in any subject, on 
the basis of results from the Bausch and Lomb 
‘Ortho-rater,’ was an extreme exophoria in the 
case of L. A. R. The subjects ranged in age 
from 28 to 34 years. 

The psychophysical method of constant 
stimuli was used in all determinations of thresh- 
olds. For each subject a coded order of stimu- 
lus presentations, utilizing five positions of the 
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variable rod, was made at the beginning of the 
experiment. This order was used in construct- 
ing all experimental! series for any given subject. 
One hundred presentations constituted a com- 
plete daily series, and two such series were ob- 
tained for the computation of a single threshold. 

A ‘forced choice’ procedure was used through- 
out the experiment. In the real depth situation 
the response was cither ‘back’ or ‘front.’ In the 
vernier situation the response was either ‘right’ 
or ‘left.’ 

On entering the experimental room each sub- 
ject was given adequate time to adapt to the 
brightness of the field being used that day. The 
subject was then instructed to fixate on the 
region between the two black crosses appearing 
in the median plane; fixation was made in the 
area between the crosses so that troublesome 
after images might be avoided. The stimulus 
series was then begun by presenting the test 
object for four sec. and the crosses for the suc- 
ceeding 11 sec. After the 11-sec. interval the 
test object was again presented. This schedule, 
with the exception of five short rest intervals, 
was followed until the series was completed. In 
the real depth situation the subject was supplied 
with a biting board which held the head in a 
fixed position during the time the test object 
was exposed. 

The parameter that was varied in both of the 
main experimental situations was the brightness of 
the visual field. Brightnesses were measured by 
a Macbeth illuminometer. Thresholds with the 
various brightnesses were obtained in a random 
order that was the same for both the real depth 
and the vernier situations. This order was 192, 
0.16, 4.8, 0.04, 0.90, ando.o7 millilamberts. Two 
experimental sessions were required for the 
determination of a threshold at each brightness 
level. Thus, each subject observed a total of 24 
sessions for the six brightnesses of vernier and 
realdepth. In the real depth situation one more 
brightness, 21.7 millilamberts, was added to the 
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THRESHOLD IN SECONDS OF ARC 








LOG MILLILAMBERTS 


Fic. 3. Thresholds for each of the three sub- 
jects for each of the brightnesses in the vernier 
situation 


above series, in the case of one subject, L. A. R. 
All the real depth thresholds were determined 
during the first half of the experiment, and all the 
vernier thresholds were determined during the 
second half.* 

The computation of thresholds was carried 
out in a traditional manner that has been de- 
scribed in detail previously (x). Also in ac- 
cordance with tradition, the magnitudes of 
vernier thresholds are expressed in terms of 
angle 3 (or angle 4, which equals angle 3) in Fig. 
2 B. Real depth thresholds refer, on the other 
hand, to the sum of angles 1 and 2 in Fig. 2A. 


REsULTs 


Table I presents the results in the 
form of thresholds computed in the 
manner outlined immediately above. 
Figs. 3 and 4 present the same data in 
graphical form. It is apparent that 
both vernier and depth discrimination 
suffer markedly with a reduction of 
field brightness to a level of one milli- 
lambert or lower. At a brightness 
slightly below .o4 millilambert the 
test objects are no longer resolved and 


2 The necessity for this schedule was dictated 
by the extreme difficulty of lining up the appara- 
tus to the required standards of precision. In an 
earlier experiment (1) it was shown that no im- 
portant effects due to practice or fatigue were to 
be expected with this arrangement of the experi- 
ment when experienced subjects were employed. 


all forms of discrimination deteriorate. 
Above 10 millilamberts, however, 


there appears to be no considerable 
improvement in vernier or depth dis- 
crimination with further increases in 
the level of brightness. 


Discussion 


A comparison of the vernier and 
depth functions appears to show that 
under the present experimental condi- 
tions the depth function is much more 
affected by brightness changes than is 
the vernier. It must be borne in 
mind, however, that each of the depth 
thresholds of Fig. 4 represents the 
sum of the two angles for the separate 
eyes. On the assumption that the 
retinal resolution of the individual eye 
is the essential factor limiting perform- 
ance at the lowest levels of brightness, 
it might be expected that depth thresh- 


is 


THRESHOLD IN SECONDS OF ARC 








=| ie) I 2 
LOG MILLILAMBERTS 
Fic. 4. Thresholds for each of the three sub- 


jects for each of the brightnesses in the real 
depth situation 
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olds as measured here would be twice 
as great as vernier thresholds. 

The above expectation is realized in 
the data of Figs. 3 and 4. The point 
is made clear by replotting the data as 
the curves of Fig. 5, where the mean 
thresholds are all in terms of the 
angle measured at the individual eye 
(namely 3 in the vernier situation of 
Fig. 2B, or 1 in the depth situation of 
Fig. 2A). When drawn in this man- 
ner the curves for the two functions 
are seen to overlap one another at the 
lowest levels of brightness and to di- 
verge from one another as the bright- 
ness increases. 

The present results bear some super- 
ficial resemblance to those of Mueller 
and Lloyd (5) in that our curve for 
‘real depth’ resembles in shape the 
lower (presumably cone-receptor) por- 
tion of their Fig. 1 for stereoscopic 
acuity. We have refrained from the 
fitting of any theoretical equation to 
either of our curves, however, in the 
absence of any indication of which, if 
either, of these functions can be han- 
dled in that fashion? Our Fig. 5 
does suggest that resolution by the in- 
dividual eye is the limiting factor at 
low levels of brightness. 

It is of interest to note that the 
thresholds reported by Mueller and 
Lloyd for stereoscopic acuity at high 
levels of brightness are considerably 
above those of recent investigators 
(see Walls, 7, p. 498). A threshold in 
the neighborhood of two sec. has com- 
monly been reported, while none of 
the thresholds of Mueller and Lloyd 
is below eight sec. A possible expla- 
nation of the discrepancy lies in their 
use of relatively thick lines; but evi- 
dence recently obtained would seem 
to show that this factor is not an im- 


* Walls (7) has pointed out that vernier and 
stereoscopic ‘acuities’ have little in common with 
visual acuity as measured by a line or grating 
test objects. 
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THRESHOLDS IN SECONDS OF ARC 








Fic. 5. Mean thresholds in terms of the 
angle measured at the individual eye; this does 
not affect the vernier thresholds of Fig. 3 but 
divides by a factor of two the real depth thresh- 
olds of Fig. 4. Curve A is for the mean vernier 
thresholds; Curve B is for the mean depth 
thresholds. 


portant one. Another possibility 
might be that the ‘bracketing’ pro- 
cedure which they employed would 
yield higher thresholds than the 
method of constant stimuli. We 
have recently obtained evidence on 
this point also, showing that the 
former procedure yields thresholds 
which are only slightly greater than 
the latter. Perhaps the discrepancy 
can only be attributed to differences 
in subjects or apparatus used in each 
of the investigations. 

The present experiment leaves un- 
answered the question as to how two 
subliminal vernier displacements for 
the individual eyes can be combined to 
give a discriminable displacement in 
depth at moderate to high levels of 
brightness. 


SUMMARY 


This study is an attempt to meas- 
ure vernier and real depth thresholds 
under various brightnesses of the visual 
field. The test objects were two 
vertical steel rods placed one above 
the other with a three-mm. vertical 
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separation between them. The top 
rod was stationary in both the vernier 
and real depth situations. The bot- 
tom rod was adjustable in the frontal 
plane in the vernier situation; in the 
median plane in the real depth situ- 
ation. The stationary rod was 4622 
mm. from the subjects’ eyes in both 
situations. A two-category constant 
stimulus method was used as the psy- 
chophysical method, with each thresh- 
old based on 200 responses. There 
were six brightnesses of field used in 
both the vernier and real depth situ- 
ations. The brightnesses, measured 


in millilamberts, were: 0.04, 0.07, 
0.16, 0.90, 4.8, 192. 
adult male subjects. 


There were three 


1. Comparison of the vernier and 
real depth thresholds indicates that 
depth discrimination is more influ- 
enced by brightness changes than is 
vernier discrimination. 

2. There is a regular curvilinear 
change in threshold values with in- 
creasing brightnesses in both vernier 
and real depth discrimination. The 
decrease in threshold values is most 
marked from 0.04 to 10 millilamberts. 

3. It can be postulated on the basis 
of the data that at the lowest illumin- 
ations the essential factor in both 
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vernier and real depth discrimination 
is the retinal resolution of the single 
eye. At the higher illuminations 
differential factors may be assumed to 
be operating in the two types of dis- 
crimination. 


(Manuscript received June 15, 1949) 
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VISUAL ‘NORMALIZATION’ NEAR THE 
VERTICAL AND HORIZONTAL 


BY W. C. H. PRENTICE AND DAVID C. BEARDSLEE 
Swarthmore College 


Some years ago, Gibson (3, 4) re- 
ported a series of investigations con- 
cerned with the effects of prolonged 
inspection of lines that were slightly 
tilted from the vertical or the horizon- 
tal. He was able to show two effects 
of such inspection. First, the tilted 
line tended, during the inspection 
period, to show some ‘normalization’; 
“that is, it was seen as becoming more 
nearly vertical (or horizontal as the 
case might be). Second, Os showed 


‘adaptation’ such that objectively 
vertical (or horizontal) lines appeared, 
in the post-inspection period, to tilt 
away from the original line. 

These effects were discussed by 
Gibson in terms of a form of adapta- 


tion much like the adaptations that 
occur to different levels of visual 
brightness and hue. The after-effect 
can thus be explained as a matter of 
some change in the state of the visual 
system, such that new vertical or 
horizontal norms have been estab- 
lished temporarily by a basic tendency 
on the part of the organism to see 
lines as lying in the main directions of 
visual space. 

If one thinks of the entire visual 
framework as being reoriented with 
respect to retinal coordinates, how- 
ever, Gibson’s findings offer one 
difficulty. For the after-effect, as 
measured, turns out to be a highly 
localized affair;! only those new lines 
placed in close vicinity of the inspec- 

1 Gibson maintained at one time (3, 463-464) 
that judgments of the horizontal were affected 
by adaptation to a new vertical. We have been 
unable to confirm this finding (see below) and 
find it inconsistent with other aspects of the 


phenomenon, e.g., with what Gibson (4) calls 
the “areal restriction of the after-effect.” 


tion line seem to be affected. In their 
1944 monograph, Kohler and Wallach 
(6) presented a new theoretical ac- 
count of Gibson’s findings. They 
had found that long inspection of 
visual contours gave rise to a condi- 
tion they call satiation, one of whose 
consequences is a displacement of new 
contours presented in the affected 
(satiated) portion of the visual field. 
Take, for example, the situation shown 
in Fig. 1. Here O fixates the cross for 
a minute or two while the large rec- 
tangle of Fig. 1a is present. He then 
looks at the fixation point in Fig. 1b, 
and when he does, he sees the right- 
hand square as lying below the left- 
hand one, although they are objec- 
tively on the same level and look so 
before any satiation has occurred. 
Kohler and Wallach argued that the 
Gibson-effect could be accounted for 
in terms of satiation and was only a 
special case of their more general con- 
dition. In Fig. 2, their explanation 
of Gibson’s after-effect is diagrammed. 
If their explanation is correct, they 
argued, the apparent angular change 
experienced as an after-effect should 
occur as well with a vertical inspec- 
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Fic. 1. When O looks for a prolonged period 
(one to three minutes) at the fixation point in 1a 
and then looks at the corresponding point in 1b, 
he sees the right hand square lying at a lower 
level than the left one. 
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Fic. 2. The broken line marked 7 is the 
inspection-line, that is, O looks first at it until 
some satiation has occurred. He then looks at 
the test-line 7. The dotted line marked 4 
represents the apparent orientation of T when O 
looks at it after having first stared at J. In 2a 
the typica! Gibson experiment is represented; in 
2b, Kohler and Wallach’s critical modification is 
shown (see text). 


tion-line and a tilted test-line; this 
prediction was confirmed (see Fig. 
2b). 

There is, however, one aspect of 
Gibson’s discovery that does not fall 
immediately into the framework of 
satiation-theory. Gibson found that 
the original tilted inspection-line ap- 
pears to right itself during the period 
of inspection; that ‘normalization,’ 
which was essential to his theory of 
the after-effect, plays no role in the 
Kohler-Wallach theory and, in fact, 
is not deducible from anything in the 
satiation-theory as presented in 1944. 

For these reasons, we became con- 
vinced that, while ‘adaptation’ as 
measured by Gibson might be only a 
special case of figural after-effect, 
‘normalization’ might be an independ- 
ent phenomenon. Some test for this 
‘normalization’ should be constructed, 
we decided, using conditions that 
would rule out any possibility of 
figural-after-effects. The basic ar- 
rangement for such a test is fairly 
obvious. O must be exposed to a 


tilted line in one part of the visual 
field for a period long enough to give 
rise to normalization (if any) and 
then allowed to compare the appear- 
ance of that line to that of another 
lying in some unaffected portion of 
the field. By varying the angle of 
tilt of the new and unaffected line until 
it appears parallel to the one that has 
had time to normalize, it should be 
possible to measure the amount of 
righting or normalizing. 

Actually, this test is not quite so 
simple as it seems at first. For there 
is another rather well known phe- 
nomenon that threatens to complicate 
the issue. Many years ago, Werthei- 
mer (8) reported that Os who look into 
a tilted mirror and see a tipped repre- 
sentation of a room notice in time 
that the room rights itself and that 
non-vertical lines become phenom- 
enally vertical. It is now apparent, 
as a result of the researches of others 
including Kohler and Wallach (6, 311-— 
312) and Asch and Witkin (2), that 
this phenomenon occurs maximally 
when O’s entire visual field is tilted 
and is greatly reduced (sometimes 
completely destroyed) when other 
frames of reference are visible. 

Here is an experimental dilemma. 
If all vertical and horizontal lines are 
removed from O’s visual field except 
the ones in which we are interested, 
any ‘normalization’ might be inter- 
preted as a Wertheimer- rather than 
as a Gibson-effect. That the two 
must be considered different is clear 
from the fact that Gibson’s phenom- 
enon is localized in visual space and 
Wertheimer’s is not and also from the 
fact that Asch and Witkin report the 
distortion of the visual space in an 
entire tilted room to be immediate, 
although continuing to increase as 
time passes. 

If, on the other hand, we leave some 
strong vertical-horizontal structure in 
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Fic. 3. When a pair of parallel lines is pre- 
sented and another line lies between them at a 
slight angle, long inspection will reveal a tend- 
ency for the tipped line to become more nearly 
parallel to the others. That is because the 
forces b and ¢ are stronger (since the effective 
contours are nearer) than those at a and d. 


the visual field, positive Gibsonian re- 
sults might be assigned to figural after- 
effects, since satiation is known to give 
rise to tendencies for tilted lines to 
appear more nearly parallel to fram- 
ing lines (see Fig. 3). The dilemma 
was faced in various ways that will 
become clear in the following pages. 

Briefly, the figural after-effect of 
Kohler and Wallach was supposed by 
them to be the prototype of Gibson’s 
phenomenon, but ‘normalization’ was 
unaccounted for by that supposition, 
and we therefore tested the possibility 
that it might occur even when condi- 
tions like those of Kohler and Wallach 
were eliminated or controlled. 


EXPERIMENTAL CONDITIONS 


Apparatus.—The apparatus used in this 
series of experiments consisted of a large screen 
made of white laboratory cardboard 28 X 22 
in., supported by clamps on a table 30 in. high. 
A fixation point was drawn at the center of the 
screen, and two square openings were cut at 
equal distances to the right and left of the fixa- 
tion point. In Experiment 1, these squares 
were 4 in. on a side, and the nearest edge was 
2} in. from the fixation point; in Experiments 2 
and 3, the square openings were enlarged to 8 in. 
on a side (see text) with the result that the fixa- 
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tion point was then only } in. from each border, 
while all other dimensions remained the same. 

Behind each opening a white cardboard disk 
containing a three-inch line in black india ink 
was supported in such a way that E could rotate 
the line about its mid-point which coincided with 
the center of the square opening in the screen. 
A fixed pointer and a protractor attached to the 
disk made it possible to read the angular position 
of the line accurately to less than one-half degree. 

Cardboard flaps cut from the same stock as 
the screen were hinged to the screen so that either 
window could be covered at will. 

Minor modifications of this apparatus in sub- 
sequent experiments are described in the text. 

Procedure.—O was seated about three meters 
in front of the screen in such a way that the 
fixation point was at convenient eye-level. He 
was instructed informally that his task was to 
compare the tilts of the two lines while looking at 
the fixation point and that he must always report 
whether a newly exposed line was more or less 
tilted than one that had been continuously ex- 
posed. In his first sitting, he was given some 
practice in making this sort of judgment until 
it was clear that he understood the instructions 
and could maintain fixation while making the 
discrimination. A method of constant stimuli 
with two categories was used throughout. E set 
the variable line in each new position by rotating 
the disk while it was covered by the flap. 

In each experimental session, one window was 
chosen for presentation of the standard, and the 
other was kept closed except for the times when 
O was called upon to make a judgment. O 
fixated the center of the card for 90 sec., after 
which E exposed the variable line in the other 
window. O reported whether the new one 
looked more or less tilted than the standard, the 
flap was closed, and O continued to look at the 
fixation point for 30 sec. before the next exposure. 
Further judgments were made every 30.sec. for 
seven to ten min. or until O reported fatigue. 
After a 10-min. recess, observations were con- 
tinued until the judgments (typically 35 in all— 
see below) had been completed. Ninety sec. 
fixation preceded the first judgment after the 
recess as in the case of the first series 

In order to complete the range from 100 per- 
cent to zero judgments of ‘more tilted,’ it was 
typically necessary to extend testing over a total 
range of seven degrees in one-~<legree steps. An 
occasional session could be completed success- 
fully with a smaller range, and one or two re- 
quired slightly larger ones. Each O made, in 
any case, five judgments concerning each position 
of the variable, and the PSE’s were calculated 
from these data by Spearman’s method (s, pp. 
169-171). 


In all experiments to be reported, the objec- 
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tive tilt of the standard was 10 degrees (from the 
horizontal in Experiments 1 and 2, from the 
vertical in all others). 

In Experiments 1 and 2, the direction of tilt 
(clockwise or counter-clockwise) and the position 
of the standard were varied systematically as 
follows: 


R and L indicate right or left hand window as the 
location of the continuously present line; cl and 
co indicate respectively clockwise and counter- 
clockwise tilts away from true horizontal. 

Each O had four sessions, one with each condi- 
tion, spaced exactly one week apart. The long 
separation between the determinations of PSE’s 
for the different conditions was used as a pre- 
caution against possible interference with meas- 
urement by persistence of previously established 
after-effects. 


Resutts 


Experiment 1.—In the first experi- 
ment, we investigated the behavior of 
lines that are tilted slightly away 
from the horizontal. Four Os ob- 
served the procedure described above 
with the apparatus having four-inch 
square frames around the stimulus 
lines, each O using the four standard 
conditions in different order and being 
tested one day per week for four 
weeks. The complete results are 
given in Table I. 


TABLE I 
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TABLE II 
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What the results in this table mean 
is that Os tended, on the average, to 
equate the standard which was actu- 
ally 10 degrees off horizontal with a 
variable that was only eight degrees 
off. The exact size of this constant 
error seems to vary some from condi- 
tion to condition with these Os and 
from one O to another, but there can 
be no doubt of the reliability or of the 
general magnitude of the effect itself. 
Under the conditions of Experiment 
I, namely with four-in. black frames 
around the lines, a result consonant 
with Gibson’s views is obtained. 

Experiment 2.—The heavy black 
frames only four in. from the test lines 
had been used in order to prevent any 
general disorientation of O with re- 
spect to the main directions of space 
while fixating, but it might have been 
argued that there was still room for a 
figural after-effect resulting from those 
frames despite the fact that the dis- 
tances were larger than those under 
which Kohler and Wallach found 
effects of significant size. For that 
reason, the conditions of Experiment 
1 were duplicated in Experiment 2 
except for one factor; in the new 
set-up, the framing squares were now 
twice the original size, or eight in. on 
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aside. If the original effect had been 
a genuine figural after-effect, it should 
be diminished by removing the fram- 
ing lines to twice the original distance. 
This experiment was carried out with 
the same four Os as Experiment 1, 
and all other conditions were the same. 
The results appear in Table II. This 
table shows that our Os are more 
variable from condition to condition 
in the new experiment than they were 
before, but that the general tendency 
and the mean magnitude of the effect 
are unchanged.* 

Experiment 3.—The third experi- 
ment was designed to test whether 
Gibson’s phenomenon could be re- 
produced at the vertical as well as at 
the horizontal. Except for the change 
from horizontal to vertical, all other 
conditions were maintained. Eight- 
in. frames were used dround the criti- 
cal lines; five new Os were used; each 
O observed only two conditions, viz., 
‘Left-clockwise’ and ‘Right-counter- 
clockwise,’ since we were not here 
concerned with interactions between 
position and direction of rotation, 
but merely wanted to give both posi- 
tions and both directions of rotation, 
in order that any average effect could 
not be assigned to a positional bias or 
‘space error.’ The results are dis- 
played in Table III. 

In this experiment, individual dif- 
ferences are greater than in any previ- 
ous version. This fact arises ia part 
from the fact that the Os were un- 
practiced. Only one had had any 


*An interesting sidelight is the continued 
tendency for the ‘Right-clockwise’ and ‘left- 
counter-clockwise’ conditions to give consistently 
smaller errors than the other two conditions. 
Separate investigation of this phenomenon would 
carry us too far afield at the present time, but it 
is clear that these differences, if reliable, might 
have important theoretical consequences. It 
should be re-emphasized here that these condi- 
tions appeared in different serial positions 
within the total experimental sequence for 
each O. 
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TABLE Ill 


Numper or Decrees Constant Error tn THE 
Direction or THe Verticat Mave sy Os 
Tryinc To Marcu rae Trr or a Line 
Osjective.y 10-Decrees Orr THe 
VERTICAL 








Standard Positien 








18 














Grand Mean 2.12 





previous experience in psychophysical 
experiments. The internal variabili- 
ties of the series of judgments on which 
we based the PSE’s in the table are 
very much higher than in Experiments 
1 and 2, and the raw data show nu- 
merous inversions and much wider 
spreads between the zero and 100 per- 
cent points. The one experienced O 
gave both the largest and the most 
consistent results. In both condi- 
tions, she reported ‘more tilted’ 100 
percent of the time for a line that was 
only two degrees more tilted than the 
line for which she always said ‘less 
tilted.’ In other words, there was 
confusion only about one step in the 
series. Since this highly accurate set 
of judgments also gave us our largest 
deviations, since all but one of the 
ten separate measurements give re- 
sults in the expected direction, and 
since the grand mean in nearly identi- 
cal in magnitude to previous deter- 
minations, it seems reasonable to con- 
clude tentatively that the phenomena 
at the horizontal and at the vertical 
are not essentially different despite 
the great variability shown in Table 
Ill. 











360 


Experiments 4, 5, and 6.—The next 
findings we wish to record were based on 
direct qualitative comparisons rather 
than on experimental measurement. 
Two conditions were used. In one, the 
black outline windows were simply re- 
moved. If figural after-effects of any 
sort are involved in our earlier experi- 
mental conditions, they might conceiv- 
ably be maximal with eight-in. frames, 
and the fact that four-in. frames did not 
give any larger effects might then be ir- 
relevant. Five members of the labora- 
tory staff followed the usual satiation 
procedure and made one judgment each 
on the unframed lines, comparing the 
originally presented line to a newly ex- 
posed parallel one under the same view- 
ing conditions as those used in earlier 
experiments. All Os agreed that the 


new line was clearly more tilted; or, in 
other words, that the original line had 
tended to right itself during the satiation 
period. 

The logical possibility still existed that 
the vertical edges of the screen might be 
producing a figural after-effect. 


To test 
that contingency, we next cut the edges 
of the screen to parallel the standard line 
with its 10-degree tilt from the vertical, 
still leaving the standard and test lines 
in their same position relative to the 
fixation point and still omitting any 
framing lines of the sort used in Experi- 
ments 1-3. Again trained observers 
were able to agree unanimously and with 
conviction that the phenomenon was 
unchanged by this experimental vari- 
ation. One new finding does emerge 
from this technique. The edge of the 
chart is clearly visible, though somewhat 
peripheral; and while O continues to 
fixate on the cross, the tilted stimulus 
line appears to right itself with respect to 
the edge. That is, O originally sees the 
stimulus line and the edge of the chart as 
parallel and tilted away from the verti- 
cal; after a minute or so, he sees the'line 
as more nearly vertical than the edge. 
We are not yet certain what causes this 
inequality in normalization. Several 
possibilities are open: the edge forms a 
much less distinct contour in the visual 
field, it is considerably longer, and it is 
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more peripheral. We have no conclusive 
experimental evidence as yet. 

A further experiment tested whether 
tilted edges might induce rotation in ob- 
jectively vertical lines under our sati- 
ation-conditions. Everything was han- 
died as in the two immediately preceding 
sets of observations except that the 
standard and comparison lines were both 
objectively vertical while the edges of 
the card were cut at 10 degrees from the 
vertical. No O reported any difference 
between the continuously present and 
the newly exposed lines. 


These three sets of observations 
further substantiate the view that our 
stimulus-line is not being tilted away 
from its original position by ‘figure 
currents’ (or their after-effects) com- 
ing from nearby contours. It is 
difficult, however, to conceive of an 
explanation for the phenomena that 
we have demonstrated unless it is 
done in the general frame work of 
satiation theory. The most obvious 
explanation would be, for example, 
that tilted lines form ‘self-satiating’ 
figures because they lie in inhomogene- 
ously satiated fields. We know, that 
is, that the lower half of the visual 
field is more subject to experimentally 
produced effects of satiation. This is 
a kind of permanent inhomogeneity 
in the visual system, and it is con- 
ceivable that some top-to-bottom 
gradient in the visual nervous system 
could be responsible for distorting 
figure processes so that a line placed 
athwart such a gradient might be 
righted visually by being transformed 
into a process more nearly symmetri- 
cal in the brain field. 

If such a conception were adequate 
to account for our data, the vertical 
and horizontal that are effective in 
producing the phenomena would have 
to be an anatomical vertical and hori- 
zontal rather than psychological ones. 
That is, the effect should not appear 
if the lines fell on either vertical or 





VISUAL ‘NORMALIZATION’ 


horizontal meridians of the retina— 
even though phenomenally the lines 
remained tilted. 

Experiment 7.—This modification 
of the basic experimental situation was 
planned to test the above alternatives 
and determine whether the ‘normaliza- 
tion’ is in the direction of positions 
that are anatomically unique or psy- 
chologically unique. Os were first 
asked to fixate a cross in the center of 
a sheet of white cardboard having two 
vertical black stripes down its right 
and left hand edges. (This sheet was 
placed temporarily in front of the 
regular test chart and was viewed 
from the usual distance). When O 
had developed a sufficiently strong 
negative after-image of this pattern, 
he was asked to put his head in an 
adjustable head-rest and tilt his head 
and the support until the stripes of 
the after-image coincided with the 
tilted edges of the test-card which 


was now exposed by removing the 


striped pattern. All this was done 
while fixating the cross in the center 
of the test chart. When coincidence 
had been achieved, O locked the head- 
rest in place, holding his head firmly 
in the position where the vertical axis 
of his retina was now parallel to the 
10-degree test lines, and began the 
go-sec. satiation period as in Experi- 
ments 1-6. The test consisted of one 
judgment concerning the comparative 
tilts of the standard and test lines 
which were in all cases objectively 
parallel and (as an additional check 
on doubtful cases) one judgment 


TABLE IV 


Nouwser or Os Grvinc Responses ‘Morz 
Titep,’ ‘Less Tittep,’ anno ‘Equat’ 
Wuen Asxep To Report on RELATION 
Between Newry Exposep 10- 
Decree Line anp Satiatep One 
PARALLEL To It 
‘Lees’ ‘Equal’ 
15 10 
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about the relation between the stand- 
ard line and the edge of the chart (see 
above.) 

We used 60 completely naive Os 
after preliminary checks with sophisti- 
cated Os had given unanimous results 
like those with head erect. Table 
IV gives the results with naive Os. 
Of the 25 Os who reported ‘less’ or 
‘equal,’ 19 were asked to report on the 
relation between the standard line 
and the edge of the chart; 17 of them 
reported that the line was more nearly 
vertical, and two said they were paral- 
lel. In other words, at least 52 of the 
60 Os show some positive evidence for 
the Gibson-effect even when the head 
is tilted parallel to the line. Six Os 
were (through an error) not given the 
second form of the test. Only two 
Os clearly show no effect. 

This experiment, then, gives rea- 
sonably clear evidence that the ‘norm’ 
to which the tilted line returns is, at 
least in the case of the vertical, a psy- 
chological norm rather than an ana- 
tomical one. It is a vertical that is 
established by other sensory data 
rather than the retinal vertical that is 
effective. 

This psychological character of the 
norm suggests some general disorien- 
tation of the sort that is found in cases 
where the entire visual field has been 
tilted. Although our procedure rules 
out the possibility that a really general 
change has occurred (since the test 
line would also be affected in that 
case), we were concerned to discover 
whether any tendency might be 
found for more widely felt influences 
of the normalization. Gibson had 
reported (3, 463-464) that judgments 
of the horizontal were affected by 
normalization at the vertical; so we 
constructed the following experi- 
mental arrangement. 


Experiment 8.—O sat at the same dis- 
tance from the screen and observed the 
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TABLE V 


Number or Decrees Constant Error 
Counter-cLocxwist Mave sy Os 
Tryinc to Matcn tue VeRrticaL 

Arter Satiation on A Line 
Titen CLocewisr From 
Horrzontar 


Position of Standard 
L R 


—.15 “15 
—.75 .00 


same experimental arrangement used in 
the measurement experiments, 1-3. In 
this case, the standard was set at true 
vertical. Between judgments, satiation 
was established by looking at the fixation 
point on another chart about 60 degrees 
away from the standard test chart. The 


satiation or inspection chart contained a 
fixation point and a three-inch line tilted 
ten degrees away from the horizontal. 
The tilted line had its center at the same 
visual angle from the fixation point as 
the centers of the lines on the original 
chart are from its fixation point (44 in. 


at 3 meters). 

O fixated the inspection chart for 90 
sec., then turned his head through about 
60 degrees tu look at the standard test 
chart where he saw two vertical lines, 
a standard objective vertical on (e.g.) 
the left and a variable on the right. He 
made a judgment as to the direction of 
tilt of the variable while fixating the cen- 
ter of the chart and then returned to in- 
spection of the card containing the line 
tilted ten degrees off the horizontal. 
Measurements proceeded as in Experi- 
ments I-3. 

The authors served alternately as Os 
for this experiment. Only two condi- 
tions were used: standard on the left, 
and standard on the right. The direc- 
tion of tilt of the inspection-line was in 
all cases clockwise. The accuracy of 
judgment was such that the procedure 
had immediately to be modified to use 
4-degree steps in the method of constants 
instead of the one-degree step that had 
been standard in other experiments. 
The constant errors are reproduced in 
Table V. 
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It is clear that such deviations from 
zero as occur are not reliable. They 
happen, besides, to be predominantly 
in the direction opposite to that pre- 
dicted from Gibson’s findings. Since 
these same two Os had shown consist- 
ently positive results in Experiments 
1 and 2, we conclude that no influence 
on discrimination of tiit near the 
vertical appears under our conditions 
as a result of normalization at the 
horizontal. 


Discussion 


We now have three superficially 
similar but functionally separable phe- 
nomena that bear on the general 
anisotropy of visual space. 

There is, first, the general re-orienta- 
tion phenomenon that Wertheimer 
(8, 257 ff.) reported in his observations 
with the tilted mirror and that Asch 
and Witkin (1, 2,9) have recently in- 
vestigated with considerable thorough- 
ness. This effect is (at least some- 
times) immediate; it does not require 
prolonged exposure to the tilted room 
in the Asch-Witkin set-up for O to 
show a distorted perception of the 
vertical. The effect is also general; 
that is, it involves the entire percep- 
tual field and is not localized in the 
region of certain stimulus lines. And, 
corollary to that fact, O need not be 
required to fixate any particular part 
of the field. 

Second, there is the figural after- 
effect of Kohler and Wallach. This 
effect can produce the phenomenal 
righting of objectively tilted lines as 
those authors have shown, but it re- 
quires that the to-be-righted line fall 
in a part of the visual field previously 
occupied by some one of certain figures 
with specified properties. It requires 
fixation, is sharply localized in the 
visual field, develops only over fairly 
long exposure-times, and is at present 
explained by a theory about the 
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physics of the primary projection 
areas. 

Third, we have the Gibson-effect 
which we have been investigating 
here. In many ways this phenom- 
enon looks like a figural after-effect. 
It takes time to establish, and it is 
localized in the visual field; the order 
of magnitude is approximately that 
of the figural after-effect; it does not 
require any general disorientation of 
the O. And yet it differs from the 
figural after-effects that have been 
investigated to date in important 
ways. It is not dependent upon the 
present or previous existence of other 
contours in the appropriate part of 
the visual field; in fact, we have been 
unable to find any influence whatso- 
ever from the extraneous lines and 
figures that did appear in our condi- 
tions. It is also not a matter of what 


has been called ‘permanent satiation’ 
of the primary visual areas (although 


it looks very much like what is known 
of such permanent satiation) since the 
critical orientation is not retinal but 
phenomenal. It also differs in one 
respect from what Gibson reported in 
his early papers; we have been unable 
to find that vertical lines are affected 
by the ‘normalization’ of the horizon- 
tal, even in the same general part of 
the visual field. 


It does not seem possible at this time 
to fit all these phenomena into a single 
theory. Yet, in a long view, it appears 
that we must eventually be able to con- 
struct a unified theory that will encom- 
pass them all, if our science is not to be- 
come hopelessly complex. Our present 
expectation is that a basis for such a 
theory will be found in the context of the 
figural after-effect and hypotheses that 
purport to explain it. There are two 
reasons for such an expectation. In the 
first place, the present phenomenon is 
precisely like the figural after-effects in 
every way except that we have been 
unable to discover a figural cause for it. 
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In the second place, no serious alterna- 
tive has been presented. The contention 
that such facts can be accounted for by 
learning theory (e.g., 7) is untenable. 
No present learning theory can possibly 
account for either our data or those of 
Koéhler and Wallach. Explanations in 
terms of judgmental errors due to dis- 
orientation are not only inappropriate 
to our situation, since only a single part 
of the deld is affected, but do not advance 
us in any case since they do not provide 
much more than a label for the phenom- 
enon. Even Gibson’s own explanation 
merely epitomizes the data; it empha- 
sizes the special character of the vertical 
and horizontal in human visual space 
and the tendency for deviations from 
these ‘norms’ to serve as stimuli to which 
the visual system may adapt; but it in 
no way explains why the vertical and 
horizontal should have special properties 
not possessed by other positions in space. 
That explanation must presumably come 
from an understanding either of congeni- 
tal anatomico-physiological features of 
parts of the visual nervous system con- 
cerned with the vertical and horizontal 
or of developmental considerations that 
come to produce inhomogencities in the 
visual system. Those developmental 
factors might be matters of learning in 
some loose sense of the word, but they 
are not easily handled by any modern 
conceptions of the nature of learning. 
The development is more likely, it seems 
to us, to be explained adequately by the 
stimulation-history of the organism; 
that is, by gradual changes in the physi- 
cal properties of the receptive mechanism 
according to differential stimulation pat- 
terns and frequencies. Whether or not 
the satiation-theory based on hypotheses 
about electrotonus (6) can account for 
the different phenomena that are in- 
volved is another question. The great- 
est difficulty at present seems to result 
from the ‘psychological’ nature of the 
relevant factors. Nevertheless the 
special gua/e of ‘verticalness’ which per- 
sists even when one’s head is tilted must 
itself have some representation in the 
nature of brain processes, and it is con- 
ceivable that we must extend our con- 
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ception of edtiation to include the neural 
representatives of visual aspects that are 
not contained directly in the geometry of 
the stimulus. 


SuMMARY 


1. Having questioned the generality 
of Kohler and Wallach’s explanation 
of Gibson’s tilted line phenomena, 
we performed a series of experiments 
to test whether the straightening or 
normalizing of slightly tilted lines 
near the vertical and horizontal might 
occur independently of conditions 
favorable to the figural after-effect. 

2. We have been able to show that 
lines tilted 10 degrees away from the 
vertical or horizontal show a normal- 
ization of about two degrees under 
our conditions. 

3. We have shown that this effect 
is independent of the simultaneous or 
previous presence of contours that 
might produce the typical figural 
after-effect. 

4. We have shown that this phe- 
nomenon is separate from the dis- 
orientation phenomena of Wertheimer 
and Asch and Witkin inasmuch as it 
is localized in the visual field and de- 
velops only after fairly long periods of 
fixation. 

5. We have shown that it cannot be 
accounted for in terms of inhomogene- 
ities of the primary visual projection 
areas, either anatomical or physiologi- 
cal, since the effect appears in full or 
nearly full strength when the head is 
tilted to parallel the lines which then 
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fall on the vertical meridian of the 
retina. 


6. A brief theoretical discussion 


has suggested the necessity for extend- 
ing the theory of figural after-effects 
to account for these findings. 


(Manuscript received May 31, 1949) 
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INFLUENCE OF THE PRECEDING ITEM IN 

MEASUREMENTS OF THE NOISE-MASKED 

THRESHOLD BY A MODIFIED CONSTANT 
METHOD 


BY TILLMAN H. SCHAFER 
Psychology Division, Navy Electronics Laboratory, San Diego, California 


INTRODUCTION 


In tests to determine the auditory 
threshold for tonal signals in the pres- 
ence of various masking sounds, it was 
noticed that the Os often did not de- 
tect a weak but ordinarily audible 
signal following a strong signal or a 
burst of background noise. It was 
also suspected that after a long series 
of signals too weak to be detected the 
O might fail to report the first audible 
signal because of habituation or lack 
of confidence. Therefore, it was de- 


cided to measure the effect of the pre- 
ceding item over a wide range of 


intensity, neglecting possible effects 
of still earlier items in the series. 


Metuop or PrRocepURE 


Five Os were seated in separate booths pro- 
vided with headphones and a voting key. Be- 
fore each test, the masking noise was turned on 
and the Os were given a chance to become ac- 
customed to it. They were then instructed to 
vote whenever and as long as they heard a signal 
they had not heard before. E£ then injected the 
signa! at various levels, and when he was satisfied 
that they were voting on the correct signal, he 
told the Os that the test was about to begin and 
that they should continue voting as instructed. 
During the test the masking noise was on con- 
tinuously, and three-sec. signals of different 
levels were presented in succession at intervals of 
five sec. One-third of these were blanks. The 
observers were aware that there were blank pres- 
entation periods but did not know when they 
were to occur. They also knew that an error of 
commission would be counted against them with 
twice the value of a correct vote (due to the rela- 
tive number of periods with and without signals). 
Commission errors were subsequently found to 
occur at an average rate of less than one per O 
per test, indicating that guessing was success- 
fully controlled by this method. At the same 


time, competition between Os helped to keep 
them doing their best. All of the Os had had 
considerabie experience with the same kind of 
tests. 

A series of tests was chosen in which the back- 
ground was thermal noise, uniform in spectrum 
from 400 to 1600 cps and of negligible intensity 
outside this band (6). In half of the tests, the 
noise produced an average of 55 db of masking; 
that is, it raised the average O's threshold 55 
db at the frequency of the signal. In the other 
half of the tests, the noise produced an average 
of 15 db of masking. 

The signal was either a pure tone or a pair of 
pure tones separated by 25, 100, or 500 cps and 
having a geometric mean frequency of 800 cps. 
Three 144-item tests were run at each masking 
level with each of several different signals: 
(1) either 787 or 812 cps; (2) either 751 or 851; 
(3) 603; (4) 1103; (5) 787 with 812; (6) 751 
with 851; (7) 603 with 1103. 

The signal was presented at eight levels, 
covering a range of 14 db in two-db steps, ap- 
proximately centered by E during the practice 
session at the masked threshold of the five Os. 
That is, the range of variation of the signal was 
so selected that the strongest signals were almost 
always heard and the weakest signals were al- 
most never heard. The masked threshold as 
defined in this study is the intermediate level at 
which 50 percent of all signals presented are 
detected. 

A test was made up of 12 presentations of the 
signal at each of the eight levels, 96 in all, plus 
48 blank periods. These numbers were dictated 
by £’s decision that there should be about half 
as many blanks as presentations, and, more im- 
portant, that each level of the signal should be 
preceded once and only once by itself and every 
other level. The latter requirement was im- 
posed so that the possible influence of the pre- 
ceding signal could be studied and so that the 
thresholds determined in the experiment would 
be as free as possible of this influence, as sug- 
gested by Fernberger (1). 

The desired sequence was constructed as 
follows. A ‘key-form’ was chosen at random 
from hundreds listed in Otterstrom (4). A 
key-form is a sequence of the first 11 positive 
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integers, from which a sequence of the 12 musi- 
cal keynotes can be derived in which no interval 
between successive keynotes is repeated. That 
is, each interval, such as a perfect fifth, a minor 
third, etc., only occurs once in the sequence, but 
all 11 possible intervals occur. Such a sequence 
is called a ‘work-form.’ The numbers in the 
work-form are for the purposes of this study 
used to represent the eight signal levels and four 
distinct blank presentations. 

The work-form was transformed into a similar 
sequence by adding one to each element. The 
law of addition used was the same as addition of 
hours on a clock face, except that 12 was denoted 
by o. For example, 10 + 1 = 11, 10 +20, 
10+ %34= 1. This process was repeated until 
there were 12 such sequences, within which all 
of the desired successions of two levels occurred, 
except those which could only be obtained by 
interconnecting the sequences. ‘These were then 
interconnected by trial and error, so that nearly 
all the successions were obtained. Only two 
successions were omitted, one between two 
blanks and one between two very strong items. 
One of these was the result of a duplication, the 
other arose from the fact that only 143 succes- 
sions are possible in a sequence of 144 numbers. 
Finally, the four numbers representing blanks 
were replaced by a single symbol, and the se- 
quence was complete. 

The sequence finally used is shown in Table I. 
The numbers in Table I represent db of attenu- 
ation from the condition of maximum signal. 
For example, © represents maximum signal, 4 


TABLE I 


Sequence or PRESENTATIONS 
(Attenuation from maximum gain, in db) 


6 14 


8 wo 


402 12 © @ 
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represents a signal 4 db below maximum, and 
# represents a blank presentation. The order 
was used about half the time forward and half 
the time backward. As a 144-item test was ex- 
cessively tiring, it was given in halves. This 
caused one more succession to be omitted, that 
between the last item of the first half and the 
first item of the last half. However, it was only 
the succession of one blank by another. 

The Os’ votes were recorded electrically on 
a slowly advancing chemically treated roll of 
paper. At the same time, the number of deci- 
bels of attenuation from maximum gain was 
recorded. In this way, it was possible to corre- 
late the Os’ votes very precisely with the signal 
level. After the test, the votes were tallied on 
score sheets in such a way that they could readily 
be broken down by signal level, preceding signal 
level, observer, kind of signal, or level of the 
masking noise. 


TREATMENT OF THE DaTA 


The unit of data is an indication by an O that 
he heard the signal at a certain level when the 
preceding signal was at a certain level or the 
absence of this indication. At a masking level 
of 55 db, 3 complete replications of a 144-item 
test on each of 5 Os and 7 signals were obtained. 
At a masking level of 15 db almost the same 
amount of data was obtained, but one O's tests 
were not quite complete and had to be dis- 
carded. Thus, the total number of data ob- 
tained was (3 X 144 X 5 X 7) + (3 X 144 X4 
X 7) =27,216. 

In the three tests for each signal and masking 
level, there were three presentations at each level 
following each preceding level but 12 presenta- 
tions at each level following blank presentation 
periods. There were also slight irregularities 
in the number of data caused by the imperfec- 
tions in the presentation sequence. In order to 
give the data for every preceding item level 
equal weight, the unusual data were reduced to 
the basis of three as a perfect score. 

The tables of number detected (out of three 
possible) as a function of level might be plotted 
immediately to give the psychometric function 
for a tonal signal masked by random noise, which 
in general varies with observer, signal, masking 
level and preceding item level. For the present 
purpose, we desire to eliminate the variation in 
the psychometric function due to all these vari- 
ables except preceding item level, and we desire 
to make the most precise estimate of its variation 
with preceding item level. It is sufficient for 
this purpose to discuss the variation of one point 
on the psychometric function. The point chosen 
is the ‘threshold,’ the level at which 50 percent 
of the signals presented are detected. 
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TABLE Il 
Mean Numper or Sicnats Derecren (Out or Turet) at L= 1 











db of 


Preceding Item Level (P db above Threshold) 





Masking Frequency (cps) 





787 or 812 
787 and 812 
751 or 851 
751 and 851 
603 
1103 
603 and 1103 








787 or 812 
787 and 812 
751 or 851 
751 and 851 
603 
1103 
603 and 1103 



































Suppose that an observer, a signal and a 
masking level have been chosen. At each level 
of signal presentation, we find the mean number 
detected; that is, we deliberately average out the 
effect of the preceding item. We then draw the 
psychometric function and read off the threshold. 
Henceforth, levels of the signal in tests involving 
the chosen observer, signal, and masking level 
are expressed in db above the threshold. When 
all the data have been referred to similar thresh- 
olds, they may be averaged in any way desired, 
without regard to any variable except signal 
level L and preceding signal level P. Here L and 
P are levels in db with respect to the physically 
variable but psychologically constant threshold 
determined, as described above, by averaging 
the data over all preceding item levels for each 
combination of observer, signal and masking 
level. 

For presentation and analysis, because of the 
large number of data available, it is necessary to 
average corresponding data for the five or four 
Os (the number depending on the masking 
level). This reduces the variability of the data, 
but it also reduces the number of degress of free- 
dom (independent comparisons). In spite of 
the fact that the results may not be quite the 
same, numerically, it seems certain that no great 
error is committed in discussing the average 
number detected by all Os instead of treating all 
the data separately, especially since the differ- 
ence in Os’ thresholds has already been allowed 
for. 

Average data' for L =: 1 are presented in 


1For detailed data at other levels order 
Document 2678 from American Documen- 


Table II as a function of the remaining variables. 
The data for high masking level occupy the top 
half of each section; those for low masking level, 
the bottom half. The signal frequencies are 
shown at the left. The data on average number 
detected out of three chances are arranged in 
columns for which the level of the preceding item, 
P, is constant. This is a sample of the basic 
data whose variation will be analyzed in what 
follows. 

Table III gives the mean number detected, 
for all signals, as a function of L and P. Along 
any row, L is constant and P varies from 9 db 
above threshold to —«. Inspection of Table 
III shows that variation with P is strongest near 
threshold (L = +1) and rapidly decreases with 
increasing or decreasing L. For example, at 
L, = 1, the number detected increases from 1.43 
at P = 9 to 2.25 at P = 3, while at L = 7 there 
is no variation at all with P, as all items are 
detected. 

The variation of the preceding item effect 
with L makes it necessary to test the significance 
of the observed effect at each L. To do this, 
it will be necessary to introduce a little notation. 

Denote a basic datum by Yp;. The subscript 
P identifies the value of P pertaining to the 
datum, and takes on values ranging from P, to 
P,, where k = 9. This merely means that P can 
have nine different values in these tests. The 
subscript i identifies the datum within the 


tation Institute, 1719 N St., N.W., Washington 
6, D.C., remitting $0.50 for microfilm (images one 
in. high on standard 35 mm. motion picture 
film) or $0.80 for photocopies (6 X 8 in.) read- 
able without optical aid. 
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Mean Numeer or Stcnats Derecrep (Our or Taree) ror Att Tests 














P (db above Threshold) 
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3-00 
2.92 
2.58 
1.71 
0.88 
2.39 
0.24 



































sample, and runs from 1 to Np. Table II shows 
that Np is 12 at P = 9, 10 at P — 5, and 14 at 
all other levels. The total number of data, N, 
which is the sum of the values of Np, is thus 
120, 

Denote the mean of the data at one value of 
P by Yp and the mean of all the data by YF. 
Let us call the former a sample mean; the latter, 
the general mean. ‘Then the total sum of squares 
of deviations of the data from F can be divided 
into two parts, the sum of squares of deviations 
from the sample means Pp and the weighted 
sum of squares of deviations of the sample means 
from the general mean F (s). 

The fundamental identity is 


Pe Np 
zy =U(Yr,- PY 
P=P, imi 
Pi Np 
= = 2 (¥p,— PrP 


P=P, i=! 


Pt 
+ © Ne(¥rp—PY (1) 
PP, 


Each part of Equation (1) provides an esti- 
mate of the variance of the set of all data that 
could be taken at level L. The estimates are 
obtained by dividing each part of the sum of 
squares of deviations by the greatest number of 
independent comparisons (degrees of freedom) 
that can be made among the data used to com- 
pute that sum of squares. The corresponding 
estimates of variance are shown in the first line 
of Table IV. 

To show that there is a real preceding item 
effect at some level L, it is sufficient to show that 
the estimate of the variance made from the third 
column of Table IV is significantly greater than 
the estimate based on the second column. In 
other words, it is sufficient to show that the 
mean percentage detected varies more with pre- 
ceding item level than would be expected from 


the observed scatter of the data about the differ- 
ent sample means. 

The estimates of variance computed for the 
data are shown in Table IV, together with their 
ratio, F (7). The probability of getting a 
greater F by chance, assuming that all the data 
come from the same normal universe, is shown 
in the right-hand column. 


REsULTsS 


Table III has shown that practi- 
cally all items are detected at L = 7 
or 5. Therefore, these are not in- 
cluded in Table IV. Table IV shows 
that the observed variation at L = 3 
or --5 could easily be due to chance, 
but that the observed variation at 
L = 1, —1, or —3 can hardly be ex- 
plained in this way. In view of the 
level of significance shown and the 
similarity in the form of the variation 
at these levels, we feel quite safe in 
accepting the observed variation with 
preceding item as a real effect. 


TABLE IV 


Estimates oF VARIANCE oF MEAN 
Numper Detectrep 








Based on 
Variation 
Within 
Samples 


Based on 
Variation of 
Sample 
Means 





OI! 
0.19 
0.23 
0.16 
0.15 


O11 
0.90 
0.74 
0.41 
0.18 
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LEVEL OF PRECEDING ITEM RELATIVE TO THRESHOLD (P) 


Fr. 1. 
ing item level L constant. 
Marks at right indicate the mean for each L. 


Its characteristic features are shown 
in Fig. 1, which presents a family of 
graphs of the mean percentage detected 
as a function of P, with L held con- 
stant at a different level on each 
graph. The principal features noted, 
following the graphs for L = 1 and 


Mean percent detected as a function of preceding item level P relative to threshold, hold- 
Vertical bars indicate plus or minus one standard error of the mean. 


—1 from left to right, are a slight dip 
below the general mean at P = L, a 
rise to a peak well above average at 
P = 3, and a sharp and continuous 
decrease beyond P = 3. The per- 
centage of items detected at 1 db 
above threshold falls from 75 percent 
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Fic. 2. 


Mean percent detected as a function of item level L relative 


to threshold, holding preceding level P constant 
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TABLE V 


Rance or Variation or Mean 
Percentace Derecrep 
Level, L (db) Range (%) 


7 
5 


3 
I 


-1 
9 
ei 


when the preceding item is 3 db above 
threshold to 48 percent when the pre- 
ceding item is 9 db above threshold. 
Similarly, at 1 db below threshold the 
percentage of items detected falls from 
50 percent when the preceding item 
is 3 db above threshold to 27 percent 
when the preceding item is 9 db above 
threshold. The dependence becomes 
less for items farther above or below 
threshold. For each level L, the range 
of variation, that is, the difference be- 
tween the greatest and the least mean 
percentages detected, is tabulated in 
Table V. 

The preceding item effect causes 
marked distortion of the psychomet- 
ric function. Fig. 2 shcws the psy- 
chometric functions for signals follow- 
ing signals of level P = 9, 3 and ~. 
The graph for P = 3 shows a small 
improvement in detection with respect 
to the graph for P = —«. That for 
P = g shows the opposite. Similar 
curves for other values of P could be 
plotted from the columns of Table III; 
these would be intermediate between 
the extremes shown. 


Discussion 


It does not seem likely that the 
present observations could be ac- 
counted for by purely physiological 
factors. Rather, true psychological 
factors appear to be involved which 
are concerned with set and with 


doubtful judgments. For example, 
it seems that a loud tone leads O to 
expect another loud tone and thereby 
makes him less certain that he heard 
a weak tone following. On the other 
hand, when a weak tone is heard, it 
appears to establish a set which causes 
the following weak tone to be re- 
garded as a confirmation of an orig- 
inally doubtful judgment that was 
suspended until a confirming judg- 
ment could be made. 

Effects similar to these have been 
discussed in the literature in terms of 
time error, contrast, central tendency, 
and anchoring. Helson (2) consid- 
ered these effects from a unified point 
of view using the concept of adapta- 
tion level as a frame of reference. The 
most closely related work seems to be 
that of Long (3), who showed that 
when a series of tones of variable 
intensity are separated by a standard 
near the top of the series, the weaker 
tones of the series are underestimated. 
This is in harmony with the main 
effect demonstrated in these tests, the 
elevation of the threshold when a 
strong signal precedes. However, 
there does not seem to be anything in 
Long’s results similar to the slight 
lowering of threshold observed with 
moderate preceding signals. But the 
experiments were quite different and 
the effect is not great. Perhaps it 
would be well not to insist too strongly 
on the effect until further data are 
available, though for the sample in 
hand it is statistically significant. 


SumMaRY OF REsULTs 


1. The entire psychometric function 
for three-sec. tonal signals presented 
at five-sec. intervals in a background 


of thermal noise varies with the 
intensity of the preceding signal. 
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2. Within one db of threshold the 
number of equally intense signals 
detected varies by approximately 25 
percent as a function of the preceding 
signal. A relatively intense preced- 
ing signal reduces the percentage 
detected, a moderately intense pre- 
ceding signal increases it, and a pre- 
ceding signal at the same intensity 
slightly reduces it. 

3. Following a suggestion of Fern- 
berger’s for mitigating the influence of 
the preceding item, a sequence of a 
chosen set of signal intensities is given 
in which each precedes all of the others 


and itself very nearly the same num- 
ber of times. 


(Manuscript received June 6, 1949) 
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PERCEPTUAL ORGANIZATION OF MATERIALS AS A 
FACTOR INFLUENCING EASE OF LEARNING 
AND DEGREE OF RETENTION! 

BY EZRA V. SAUL AND CHARLES E. OSGOOD* 

University of Connecticut 


INTRODUCTION 


The fundamental precept of the 
gestalt approach to learning and mem- 
ory is that perceptual processes persist 
in and determine mnemonic processes. 
This is accomplished through transla- 
tion of mnemonic processes into ‘dy- 
namic traces existing within a brain 
field’ where gestalt principles of per- 
ception are directly applicable. The 
following quotation from Koffka (2) 
gives the essence of this view: “If any 
perceived form is a product of organiz- 
ation... [it] is sustained by real 
forces. . . . [In] cases of very irregu- 
lar patterns the internal forces of 


organization will . . . be in conflict 
with the external ones; the perceived 


forms will be under stress. ‘There- 
fore, if the trace retains the dynamic 
pattern of the original excitation, it 
will be under stress too, and the 
changes which occur within it will be 
such as to reduce those internal 
stresses” (p. 496). Several experi- 
mentally testable hypotheses evolve 
from this basic principle, only two of 
which will be considered here. (1) 
The better the perceptual organization 
or structure of the materials to be 
learned and retained, the more stable 
will be the traces laid down and hence 
the greater will be the ease of learning 
and degree of retention. (2) Since 
the stresses present in original per- 
ception persist in the trace, mnemonic 


1 A preliminary report of this research was de- 
livered at the Eastern Psychological Association 
Meeting in Springfield, April 8, 1949 

*The second author is now located at the 
University of Llinois. 
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changes should be predictable in*di- 
rection and progressive in time. The 
first of these statements is the topic 
of the present paper; the second will 
be considered in a later communica- 
tion. 


The classic studies on organization of 
materials for learning were contributed 
by Von Restorff (5) under Kéhler’s aegis. 
She found that when relatively isolated 
materials of one type were imbedded in 
lists of other massed materials, recall of 
the isolated items was clearly superior. 
A companion experiment suggested that 
the difference between isolated and 
massed items was attributable to the 
tendency for the similar massed items 
to ‘agglutinize’ or assimilate with one 
another, thus destroying their distinctive 
characters. Pillsbury and Rausch (3) 
varied the proportions of numbers (N) 
and nonsense syllables (S) in eight-item 
lists, read once and immediately recalled, 
from 1N-7S to 7N-15S; recall was found 
to be a direct function of the relative 
degrees of isolation. Katona (1) reports 
a number of experiments in which learn- 
ing by rote methods is contrasted with 
learning of the same materials by ‘under- 
standing’ methods, retention being de- 
finitely superior in the latter case—while 
the ‘structural traces’ set up by meaning- 
ful, organized methods of learning are 
more stable and persist longer, according 
to Katona, chaotic ‘individual traces’ 
are set up by rote memorizing and vanish 
quickly unless continuously reinforced. 
Koffka (2, p. 523) also refers to ‘the low 
survival value of chaotic units.’ 

A major difficulty with both the Von 
Restorff and Pillsbury and Rausch ex- 
periments was that the absolute numbers 
of isolated and massed items varied along 
with degrees of isolation per se. This 
made it possible to interpret the results 
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in terms of intraserial interferences with- 
out any reference to memory traces or 
their stability. A recent study by Siegel 
(4) shows more adequate use of experi- 
mental controls. He designed lists of 
two types of material fitting the per- 
ceptual analogy shown in Table I (Col- 
umn 6), in which each type of item ap- 
pears in varying degrees of isolation yet 
the total number of items of each type is 
equal. His results also substantiated 
the gestalt hypothesis. 


But there is another serious the- 
oretical and methodological difficulty 
involved. The gestalt hypothesis 
states that perceptual organizing 
forces affect ease of learning (with 
which few psychologists would dis- 
agree, whether one prefers to specify 
gestalt dynamisms or behavioristic 
mechanisms of generalization and dis- 
crimination) and persist in and affect 
retention. In none of the experiments 
cited is it possible to separate the 
effect of perceptual organization on 
original learning from its effect on 
retention. Do well organized mem- 
ory traces resist interference and 
facilitate recall? There appear to be 
no experiments to date designed in 
such a manner that this question can 
be answered. Siegel, for example, 
presented his items once and only 
measured immediate recall—were the 
differences on this single recall due to 
ease of original learning or mnemonic 
processes? The purpose of the pres- 
ent study is to test the gestalt hy- 
pothesis as it relates to retention, by 
providing for an immediate recall (as 
a measure of original learning) and 
two delayed recalls (as measures of 
retention). 


PRoceDURE 


An adequately designed experiment to test 
the gestalt hypothesis that perceptual organiza- 
tion of materials influences original learning and 
recall should meet at least the following criteria: 


(1) Isolation or homogencity of materials 
should be varied through a number,of objectively 
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specifiable degrees. (2) The absolute number of 
items for each of the two types of material em- 
ployed (here numbers and nonsense syllables) 
should be equal. (3) With different groups of 
subjects the two types of material should be 
alternated in order to eliminate any special ad- 
vantages for learning or recalling numbers of 
syllables. (4) An objective method of recording 
and interpreting subjects’ responses should be 
provided. (5) There should be provision for 
independent estimation of original learning and 
retention. Since Siegel’s procedures and ma- 
terials seemed to meet all these criteria except 
the last, we have duplicated them as faithfully 
as possible, adding two delayed recalls (after 1 
and 24 hours) to satisfy the fifth criterion. 

Ten nonsense syllables and ten numbers were 
ordered so as to fit the perceptual analogy shown 
in Table I (Column 6). T'wo lists were con- 
structed, the second reversing al! syllable-num- 
ber positions (i.c., when a syllable is placed in 
Series I, a number is placed in Series 11). The 
two lists are given as Columns 8 and 9, serial 
position being indicated in Column 5. Whereas 
Siegel limited his analysis to items No. 3 and 
No. 13 (as being relatively isolated) and items 
No. 6 and No. 16 (as being relatively homogene- 
ous), the present investigators wished to extend 
the analysis to measurable degrees of homogene- 
ity. ‘Therefore, the homogeneity ratings shown 
in Column 7 were computed in the following 
manner: the three items both preceding and suc- 
ceeding were weighted according to their distance 
from a given item, a similar-type item directly 
adjacent being rated 3, an item two steps away 
being rated 2, and three steps away 1; using item 
No. 10 as an example, since items No. 9 and No. 
7 are of different type, they do not count, nor do 
items No. 11 and No. 12, but item No. 8 con- 
tributes 2 degrees of homogencity and item No. 
13 contributes 1 degree—the homogencity rating 
for item No. 10 becomes 3. Positions beyond 
the ends of the series were counted as dissimilar 
items. 

The subjects, 160 undergraduates taking 
psychology at the University of Connecticut, 
were read the following instructions: “On the 
screen before you there will be exposed a series 
of nonsense syllables and numbers in random 
order. Each item will be exposed for a short 
interval of time, and I want you to try to re- 
member as many items as you can. The order 
makes no difference. I am interested only in the 
total number of items that you can recall cor- 
rectly. Please attend carefully and avoid mak- 
ing any comment.” The lists of items were then 
presented through the use of slides on an East- 
man overhead slide projector. This detail 
represents a slight deviation from Siegel’s pro- 
cedure—he printed the items on cards and held 
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them up before a class of subjects, and this may 
be the reason our recall scores are consistently 
higher than his. Each slide contained only one 
item and was exposed for five sec. The rhythm 
of presentation and exposure was carefully re- 
hearsed. The presentation times were the same 
in both studies. 

Following presentation of the last item the S’s 
were instructed as follows: “On the slips of paper 
being passed out please record your name in the 
upper right-hand corner. Then record legibly as 
many of the items as you can recall.” Three to 
five min. was the usual time required for the Ss to 
complete the recording of items. At this point, 
and with little opportunity for discussion, a lec- 
ture was begun which required their full atten- 
tion. One hour after the initial recall test, E 
re-entered the classroom and instructed the Ss 
as follows: “A short time ago you were asked to 
take part in a memory experiment. As a follow- 
up of that experiment you are now asked to recall 
correctly as many of the items from the experi- 
mental list as you can. Record your recollec- 
tions legibly with your name on the paper pro- 


vided.” Twenty-three hours later, at the recon- 
vening of the class, E again appeared and 
repeated the above instructions. It should be 
noted that there was no reason for Ss to antici- 
pate either of the delayed recalls. A rigorous 
scoring criterion, also used by Siegel, was ob- 
served, only absolutely correct items, with no 
letter substitutions, being counted. 


REsULTs 


1. Original Learning.—The basic 
data from which all values given in 
this report were computed are given 


in Table I. Columns 1 and 3, for 
Siegel’s study and the present one, 
compare the absolute numbers of Ss 
recording the items in each serial posi- 
tion on the immediate recall (measure 
of learning). Columns 2 and 4 trans- 
form these recall scores into percent- 
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ages of the total number of Ss, these 
values being more comparable since 
Siegel used 70 Ss on each list and we 
used 80. Fig. 1 provides a graphic 
record of these percentage values. 
Both experiments yield fairly typical 
serial learning curves, items at the 
ends of the list being relatively easier 
to learn. But while Siegel obtained 
clear evidence for the superior learn- 
ing (and/or retention) of the relatively 
isolated items (No. 3 and No. 13) as 
contrasted with the relatively im- 
bedded items (No. 6 and No. 16), only 
item No. 3 stands out in our data and 
this might merely reflect the fact that 
item No. 3 was always the first item of 
a new type (i.e., either the first sylla- 
ble or the first number to appear in 
the list). 

Are these results possibly due to the 
arbitrary selection of items 3, 6, 13, 
and 16? Fig. 2 plots percent correct 
recalls as a function of degrees of 


homogeneity, the light lines represent- 
ing average values for each degree of 
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homogeneity and the heavy lines 
representing combined weighted aver- 
ages for isolated (degrees 0, 1, and 2), 
intermediate (degrees 3, 4, and 5) and 
homogeneous (degrees 6, 7, and 8) 
items. The numbers in parentheses 
below the homogeneity ratings indi- 
cate how many items contributed to 
each degree. While the combined 
weighted averages for Siegel’s data, 
(dashed heavy line,) display the pre- 
dicted trend—immediate recall values 
decreasing with increasing homogene- 
ity—the relation is less apparent in 
the present data (solid heavy line). 
In fact, if item No. 3 is eliminated, the 
trend disappears entirely, the most 
isolated items now having an average 
recall value of only 30.9. 

2. Retention.—-The data in the pres- 
ent experiment on the effect of per- 
ceptual organization upon retention 
cannot be compared with Siegel’s 
earlier study, since he failed to use any 
delayed recall condition and hence 
could not measure it. Columns 10 











Fic. 1. Original learning as a function of serial position—data 
from Siegel (4) and the present experiment 
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and 11 in Table I give the average 
numbers of Ss recalling each item on 
the one-hour and 24-hour tests re- 
spectively. As would be expected, 
delayed recall scores are lower than 
immediate recall scores in every case 
but one, i.e., forgetting occurs. Quite 
unexpected, however, is the fact that 
scores for the 24-hour recall are not 
consistently lower than those for the 
one-hour recall; for 7 of the 20 serial 
positions retention is better after 24 
hours than it had been after only one 
hour. This facilitative phenomenon, 
resembling the Ballard-Williams type 
of reminiscence, bears no consistent 
relation to the serial order of the items 
nor to degrees of homogeneity—in 
fact, the most isolated items show this 
type of facilitation the least, which is 
quite contrary to the gestalt hypothe- 
sis. One relation of possible signifi- 
cance did appear, however: when the 
items displaying this reminiscent effect 
are analyzed in terms of the type of 
material, there are four times as many 
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reminiscent nonsense syllables as there 
are reminiscent numbers. Why this 
should be is not apparent—it is possi- 
ble, of course, that three-digit num- 
bers suffered more retroactive inter- 
ference during the 24-hour interval, 
but as an explanation this reasoning 
is entirely circular. 

To test the retention aspect of the 
gestalt hypothesis it is necessary to 
equate degrees of original learning 
and then determine if differential 
rates of forgetting are shown by iso- 
lated and homogeneous items. Using 
the level of immediate recall for each 
serial position (measure of original 
learning) as a base-line, deviations 
from this level at one- and 24-hour 
delayed recalls were computed as per- 
centages. These percent losses are 
given in columns 12 and 13 in Table I, 
and a very slight tendency for forget- 
ting to become greater toward the end 
of the serial list is shown. With re- 
gard to the critical items, while iso- 
lated items No. 3 and No. 13 do show 
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comparatively small losses, other less 
isolated items (viz. No. 1 and No. 8) 
have equally small or smaller losses, 
and one of the critical imbedded items, 
No. 6, shows less forgetting than either 
of the critical isolated ones! Fig. 3 
plots percent losses for one- and 24- 
hour recalls as functions of degree of 
homogeneity, light lines again being 
used for individual degrees and heavy 
lines for combined averages (0, 1, 2 
degrees, 3, 4, 5 degrees and 6, 7, 8 
degrees). Resuits for the one-hour 
recall (R;-R;:) are shown in dashed 
lines and those for the 24-hour recall 
(R;-R;) in solid lines. Imbeddedness 
of items is not found to have any con- 
sistent effect upon their retention. 


Discussion 


The gestalt conception of memory, 
as it has been developed by theorists 
like Koffka (2) and Katona (1), places 
a heavy emphasis on the perceptual 
organization of materials. Indeed, 
the core of their position is that forces 


present in original perception persist 
in the memory trace and continue to 
work toward their own resolution. 
One concrete hypothesis deriving from 
this view is that the memory traces of 
isolated items in a list of materials, 
standing out as figures on a ground, 
will be more stable than the memory 
traces of homogeneous items, which 
suffer the fate of assimilation with 
others of their kind. Not only should 
this lead to easier learning of isolated 
items in such a list, but also they 
should be better retained—in fact, 
since perceptual stresses in the pat- 
tern of memory traces are assumed to 
continue working through time, the 
differences between the retention of 
isolated and homogeneous items 
should increase with the length of the 
retention interval. 

A number of experiments, chiefly 
those of Van Restorff (5), Pillsbury 
and Raush (3) and Siegel (4), have 
demonstrated the superior learning 
and/or retention of the relatively 
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isolated items in lists of material. 
Since the designs of all these studies 
made it impossible to segregate the 
effect of perceptual organization upon 
memory from its effect upon learning, 
it is quite possible that they merely 
demonstrate facilitative effects of the 
organization of materials upon ease of 
learning. ‘This is actually not a point 
of issue between gestalt and behavior- 
istic theories. Techniques employed 
to improve the ‘structure’ or ‘organi- 
zation’ of materials are equally inter- 
pretable as techniques for minimizing 
intralist or interlist generalization, 
and hence of improving initial dis- 
criminability. Siegel (4), for ex- 
ample, in the discussion of his results 
states that “Gibson’s analysis of mem- 
ory employing S-R constructs, very 
nicely generates Restorff’s original 
results. It is conceivable that with 


some slight extension, the results here 
might be similarly deduced” (p. 315). 


As the gestalt hypothesis relates to 
memory per se, however, there are 
points at issue between it and the be- 
havioristic interference theory. The 
main issue is this: the gestalt theory 
must predict that to the extent that 
there are structural stresses set up 
during original perception, these 
stresses will persist in the trace field 
and produce progressive autonomous 
changes; the interference theory, by 
its very nature, holds that there are no 
autonomous changes in memory, all 
modifications being attributable to 
some form of interpolated activity 
(i.e., retroactive interference). In the 
present connection, the gestalt theory 
must predict that the perceptual 
organizing forces set up in a list of 
items by varying their degrees of 
isolation will persist during the re- 
tention period and produce predictable 
differences in recall. The interfer- 
ence theory would predict that if 
original learning is equated—and 
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amounts of interpolated activity in- 
volving isolated and homogeneous 
items can be assumed to be equivalent 
—then no difference in retention will 
be observed. 

The results of the present experi- 
ment clearly do not favor the gestalt 
position. When degrees of original 
learning are equated by using the 
level of immediate recall as a base-line 
from which to estimate mnemonic 
changes, no evidence is obtained for 
better retention of isolated items. It 
could, of course, be argued that no 
differences in retention would be ex- 
pected here, since we did not, in con- 
trast to Siegel, obtain very clear evi- 
dence for better original learning of 
the isolated items. Keeping in mind 
the fact that Siegel’s experiment is 
apparently the only one (except the 
present study) where degrees of isola- 
tion have been varied without simul- 
taneously varying the absolute num- 
ber of items of different types, this 
becomes a serious question. If our 
results for original learning are valid 
and Siegel’s not, then it would appear 
that isolation and massing are effec- 
tive only in terms of the number of 
similar items and resultant possibili- 
ties for intraserial interferences. If 
Siegel’s results are valid and ours not, 
then it is possible that we do not have 
a fair test of the gestalt hypothesis as 
it applies to memory—although one 
would expect the dynamic stresses to 
appear even more strongly after a 
24-hour interval. Since we purposely 
duplicated Siegel’s materials and pro- 
cedures in every way (except for pre- 
senting items on a screen rather than 
on manually held cards, which seems 
a minor difference, and adding on de- 
layed recall tests, which could have no 
effect upon the immediate recall), the 
explanation of this discrepancy on 
immediate recall must lie in some 
subtle variable—perhaps, like rats at 
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Yale and the University of California, 
students at Davidson College and the 
University of Connecticut behave in 
accordance with the theoretical pre- 
dilections of their experimenters. 


SUMMARY 


Central to the gestalt theory as it 
applies to learning and memory is the 
notion that dynamic stresses present 
in original perception persist in sub- 
sequent memory traces. This paper 
reports a test of one of the concrete 
predictions arising from this notion, 
namely, that the traces of isolated 
items in a serial list composed of two 
types of material will be better retained 
than traces of massed items. Dupli- 
cating the materials and procedures 
previously employed by Siegel (4), 
who, like other investigators on this 
problem before him, had failed to dis- 
tinguish between original learning and 
retention per se, the present investiga- 
tion used immediate recall as a meas- 
ure of original learning and two de- 
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layed recalls, after one and after 24 
hours, as measures of retention. In 
comparison with Siegel’s data, the re- 
sults of this experiment do not show 
as clear a relation between degrees of 
isolation and ease of learning. With 
regard to the effects of perceptual or- 
ganization upon memory per se, there 
was no evidence for better retention 
of isolated items. These results are 
considered in relation to theoretical 
issues. 


(Manuscript received June 2, 1949) 
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THE ACTION OF VARIOUS AFTER-EFFECTS 
ON RESPONSE REPETITION * 


BY CARL P. DUNCAN 
Northwestern University 


I. INTRODUCTION 


The problem of the action of punish- 
ment in learning has recently been 
pointed up sharply by the apparently 
differing interpretations of Thorn- 
dike’s views by Postman (4,5) and 
G. R. Stone (7,8). A related prob- 
lem, particularly in serial verbal learn- 
ing, is the phenomenon known as the 
Thorndike effect, or spread of effect. 
This is defined as an increase in repeti- 
tion of punished responses the closer 
those responses are to a rewarded 
response. The controversial litera- 
ture on these topics and on other mat- 
ters related to the law of effect has 
been reviewed by Postman (4) and 


need not be repeated here. 
Concerning punishment, Stone (7) 


has differentiated several possible 
ways in which it might act. One of 
these possibilities is that punishment 
may fixate behavior directly but to a 
lesser extent than reward. Accord- 
ing to this interpretation, punished 
behavior is eliminated not by active 
weakening but by losing out in the 
competition with rewarded responses. 
In an experimental paper Stone (9) has 
presented evidence which seems to 
corroborate his view that punishment 
strengthens verbal connections. Say- 
ing “wrong” produced repetitions of 
response with greater frequency than 
would be predicted by chance. More 
importantly, the punished responses 
were repeated with a frequency above 
an empirical baseline, obtained with a 
control group of Ss in which E did not 
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punish; i.e., said nothing after S’s 
response. 

Considerably more evidence is 
needed, however, before one can ac- 
cept the generalization that punish- 
ment strengthens verbal stimulus- 
response connections. For one thing, 
such a conclusion appears to contra- 
dict another interpretation of the 
action of punishment; viz, that it 
induces variability of response, 
thereby increasing the probability 
that the correct response will occur. 

Whatever the action of punishment, 
in serial verbal learning one must also 
consider the possible spread of both 
punishment and reward. Spread of 
effect, with both punishment and re- 
ward, has been reported by several 
E’s (4). Of special interest here is 
Stone's finding, in the experiment 
referred to above (9), that punish- 
ment not only strengthened the con- 
nection to which it was applied but 
also strengthened nearby connections 
that were neither punished or re- 
warded. This result was interpreted 
as further evidence for the conclusion 
that punishment fixates slightly. 

In his experiment Stone used only 
two conditions, saying “wrong” after 
the responses to one of the stimuli for 
one group of Ss, and saying nothing 
after the responses to the same stimu- 
lus for another group. The fre- 
quency of response repetition in this 
latter group served as the baseline to 
which was compared the repetitions 
of the punished group. With this 
design it cannot be determined what 
would have been the action of other 
after-effects, such as reward. It 
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seems necessary, if one is operating on 
the assumption that punishment fix- 
ates slightly but reward fixates even 
more, to determine the action of re- 
ward within the same experimental 
design. Then, with identical condi- 
tions except for the kind of ‘effect’ 
administered, a more adequate test of 
Stone’s generalization can be made. 
Further, because neutral after-effects 
have also been reported to strengthen 
response (4), a control is necessary in 
which E says something after the S’s 
responses which is neither rewarding 
nor punishing. The present experi- 
ment was designed to permit direct 
comparisons of all of these conditions: 
reward, punishment, meaningless 


after-effect, and no after-effect. 


Il. Procepure 


The procedure was the same as that used by 
Stone (9). The stimuli were a list of 39 English 
three letter words. Some of the procedural de- 
tails may be gleaned from the instructions to the 
S, which were as follows: 


I am going to read to you, one by one, a list 
of English three letter words, words such as 
‘mud,’ ‘fan,’ etc. Each of these words has a 
number associated with it, that number being 
somewhere between one and 10. Your task is 
to find out and to learn which number is asso- 
ciated with each of the words. This is the way 
we will do it. I will say one of the words and 
you will respond with some number between 
one and 10 which you think is associated with 
the word. After your response I will do any 
one of four things. If the number you give is 
right, I will say “right.” If the number you 
say is wrong, I will say “wrong.” Sometimes 
after you have responded with a number I will 
say something meaningless, like “Bloonpret.” 
And sometimes I will say nothing at all. 
When I say something meaningless, or when I 
don’t say anything, you won't be able to tell 
whether the number you gave is right or wrong. 
Remember, I will say one of the words, you 
will give a number between one and 10 which 
you think is associated with the word, and then 
I will say “right,” or “wrong,” or something 
meaningless, or nothing at all. You are to 
try to learn as many of the word-number pairs 
as you can, to get as many right as possible. 
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Since the instructions were rather compli- 
cated, they were given with considerable empha- 
sis and judicious use of pauses. Each S was 
asked if he understood; if necessary, parts of the 
instructions were repeated. 

Actually, there were no correct numbers to be 
associated with the stimulus words. The after- 
effect for each stimulus was fixed regardless of the 
S’s response. 

Preceding the experiment the four after-effects 
were assigned at random to stimuli 1-16 and 
28-39 and these remained the same for all Ss. 
The remaining 11 stimuli, nos. 17-27, elicit the 
responses which constitute the primary data of 
the experiment. 

For all Ss, the first five stimuli of the 11, nos. 
17-21, and the last five, nos. 23-27, were without 
after-cffect; ic, E said nothing after the Ss’ 
responses to these stimuli. The Ss were divided 
into four groups on the basis of the after-effect 
given to the middle stimulus word, no. 22. In 
the Right group, £ always said “right” after the 
responses to this stimulus. In the Wrong group, 
E always said “wrong.” Similarly, in the Mean- 
ingless Word group, E always said some meaning- 
less word after the responses to stimulus no. 22, 
and in the No Response group, E said nothing at 
all. Hereafter, the four groups will be referred 
to by the after-effect given to stimulus no. 22. 
It is to be noted that the Wrong group and the 
No Response group duplicated the two condi- 
tions used by Stone. 

In the No Response group all 11 stimuli were 
without after-effect. This is the control group 
for the experiment; the percentage of response 
repetition given by this group will serve as an 
empirical baseline to which the other three 
groups may be compared. 

With this design the stimulus word which was 
no. 22 in the list of 39 stimuli, hereafter called the 
Key Stimulus, is isolated by being both preceded 
and followed by five stimul ponse connec- 
tions which had no after-effect. The responses 
to this isolated stimulus are, in different groups of 
Ss, followed by reward, punishment, meaningless 
after-effect, or no after-effect. The action of 
these different after-effects can then be compared 
under identical conditions. 

The Ss were 120 psychology undergraduates 
at Northwestern, equally divided as to sex. 
None had yet studied in their course work ma- 
terial pertinent to this experiment. The Ss were 
assigned at random to each condition, 30 in each 
group. 

E read the stimulus words to the Ss at a rate 
of one every three sec. There was no pause be- 
tween reading the stimulus, the S’s response, 
and £’s giving the after-effect. Then there was 
a slight pause before the next stimulus was given. 
Three extra Ss were obtained to replace three 
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whote records were discarded because of their 
long pauses between stimulus and response. 
Each S was given six trials through the list. 


III. Resutts 


The decision as to what should be 
called response repetitions is to a cer- 
tain extent arbitrary. Here it was 
decided to count as one repetition any 
response which occurred on at least 
two successive trials. With six trials, 
the maximum number of repetitions 
after any stimulus is five. In some 
cases the total repetitions counted for 
one stimulus word are not all repeti- 
tions of the same response. For ex- 
ample, an S might give and repeat one 
response in a pair of successive trials, 
then in a later pair give and repeat 
some other response, all to the same 
stimulus. Repetitions of both these 
responses count in the total. 

The primary data of the experiment 
are presented in Table I. The table 


shows the number of response repeti- 
tions made by each group. of Ss after 


each of the 11 stimulus words. Since 
there are 30 Ss in each group and a 
maximum number of five repetitions 
by one S to one stimulus, the total 
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possible repetitions to any one stimu- 
lus is 150. Table I also shows the 
number of repetitions to each stimulus 
converted into a percentage of 150. 

Table I shows only one fairly high 
percentage of response repetition, 
48.0 percent for the rewarded re- 
sponses; i.e., in the Right Group where 
E said “right” after the Key stimulus. 
This result shows more clearly in 
Fig. 1. 

Fig. 1 is derived from Table I. 
First, an empirical baseline of re- 
sponse repetition was obtained by 
taking the mean of the repetitions to 
the 11 stimuli by the No Response 
Group. This turned out to be 36 
repetitions, or 24.0 percent. Paren- 
thetically, it may be noted that one 
would predict only 10.0 percent rep- 
etition on the basis of chance. For 
responses followed by no after-effect, 
24.0 percent repetition is apparently 
a fairly reliable estimate. If one 
takes the 10 stimuli with no after- 
effect in the other groups of Ss, com- 
putes the mean repetitions, and con- 
verts these means to percentages, it is 
found that for any group or all groups 
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repetition. 


combined the percentages obtained 
vary only between 23.1 and 25.4. 
The next step in plotting Fig. 1 con- 
sisted in setting the 24.0 percent 
repetitions obtained with the No Re- 


sponse Group equal to zero. Then 
the percent repetitions at each stimu- 
lus for the other three groups were 
plotted as gains or losses from this 
baseline. 
nificance of the deviations from the 
baseline in Fig. 1 showed that only the 
point indicating the responses of the 
Right group to the Key Stimulus is a 
significant deviation, at P < .oor. 
Fig. 1 also shows no evidence for 
spread of effect, either from punish- 
mentor reward. Repetitions to stim- 
uli preceding or following either the 
rewarded or the punished connection 
show no consistent pattern of gains or 
losses. Further, the gains or losses 
that do occur do not conform to the 
spread of effect hypothesis; viz, that 


Chi-squares testing the sig-” 


The percent gain or loss in response repetition over an empirical baseline of 
See text for the method of constructing the figure. 


they should be greater the nearer the 
connections are to the rewarded or 
punished connection. 

In this experiment, then, reward 
considerably strengthened verbal 
stimulus-response connections, as has 
been found many times in previous 
experiments. Punishment did not 
weaken the connections to which it 
was applied, which confirms Thorn- 
dike’s view. Punishment also did not 
strengthen connections, and thus 
Stone’s hypothesis was not verified. 
The lack of any observable spread of 
effect fails to agree with the results of 
most previous investigators but is not 
the first time that spread has failed to 
appear (4). 

One difficulty in the use of response 
repetitions as the only measure of the 
influence of an after-effect is the all- 
or-none character of the measurement. 
An S either makes repetitions or he 
doesn’t. There are, however, con- 
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siderable differences in response pat- 
terns among those for whom no rep- 
etitions are counted. These differ- 
ences lie in the degree of variability of 
responses shown by the S. For ex- 
ample, one S may choose six different 
responses in his six trials, while an- 
other S uses only two responses but 
alternates them from trial to trial. 
For both of these Ss, there are no re- 
sponse repetitions but the latter S 
shows much less variability of re- 
sponse. Some measure of response 
variability, then, may prove revealing 
in this experiment, particularly in 
view of the hypothesis which holds 
that punishment increases response 
variability. 

A simple measure of response vari- 
ability is the number of different re- 
sponses given by an S during his six 
trials to a stimulus word. It is to be 
noted that this measure includes re- 
sponse repetition; high repetition will 
show up as fewer different responses. 
At the same time, Ss who never give 
the same response in successive trials 
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but who also use relatively few re- 
sponses will contribute to the data. 
The mean number of different re- 
sponses given by each of the four 
groups to the 11 stimuli is shown in 
Fig. 2. An analysis of variance 
among the four groups at each stimu- 
lus word shows that only the F at the 
Key Stimulus is significant, at P < 
.o1. When #-tests are employed, to 
test pairs of groups at the Key Stimu- 
lus, it is found that the Right group 
differs from each of the other three 
groups at better than the five percent 
confidence level. The other three 
groups do not differ significantly from 
each other. Although the greatest 
variability as measured by number of 
different responses did occur in the 
Wrong group, as is indicated by the 
highest point above the Key Stimulus 
in Fig. 2, this is not a statistically 
significant deviation. This is not 
surprising in view of the apparently 
random criss-crossing (according to 
the F-test) of the curves at the stimuli 
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The mean number of different responses made by each 


group to the 11 stimuli shown on the abscissa 
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preceding and following the Key 
Stimulus. 

Apparently, then, reward consider- 
ably narrows response choice; response 
variability is much reduced after 
saying “right.” With the relatively 
few trials given in this experiment, 
punishment, meaningless after-effect, 
and no after-effect neither reduce nor 
increase response variability. And 
again, in Fig. 2, there is no evidence 
of a spread of the influence of any of 
the four after-effects. 

When the data were tallied for 
number of different responses, it be- 
came obvious that another variable 
had exerted considerable influence; 
this variable is response favoritism. 
The possible significance of biases for 
certain responses for hypotheses about 
spread of effect will be considered 
later; here we shall simply present the 
evidence showing that definite prefer- 
ences didoccur. The number of times 
each of the 10 responses occurred on 
the first trial is probably the best 
measure of favoritism; the action of 
the various after-effects is presumably 
smallest at this stage of learning. 

Table II shows the frequency of oc- 
currence of each of the 10 responses 
for each group of Ss on the first trial 
for all 39 stimuli and the total for all 
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Ss. With each of the 120 Ss making 
39 responses the total number of re- 
sponses on the first trial is 4680. If 
there were no response favoritism, 
each of the 10 responses should have 
occurred 468 times. Table II shows 
that this was not the case. Testing 
the 10 deviations from 468 by a single 
chi-square indicated that the prob- 
ability that the over-all deviation oc- 
curred by chance is less than .oot. 
As Table II shows, by far the largest 
deviations from the theoretical fre- 
quency of 468 are contributed by re- 
sponses 4, 3, and 2, which occurred 
much more frequently, and by re- 
sponses 10 and 9, which occurred much 
less frequently, than would be ex- 
pected by chance alone. 

Finally, do response favoritisms 
continue to appear throughout the last 
five trials of the experiment; are the 
responses which are chosen more fre- 
quently on the first trial also repeated 
more frequently? The answer to this 
question appears in Table III in which 
is shown the number of times each of 
the 10 responses was repeated by each 
group and for all the Ss combined. 
The data in Table III were obtained 
by counting response repetitions after 
all 39 stimuli in the last five trials. 
Any one S, then, could make as many 
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as 195 repetitions, so for 120 Ss the 
total possible repetitions are 23,400. 
Actually, as the sum of the Total 
column in Table III shows, only 6585 
repetitions were made, or 28.14 per- 
cent. This is slightly higher than the 
24 percent repetition previously used 
as the empirical baseline. The dis- 
crepancy is probably due to the fact 
that responses to several of the 39 
stimuli had “right” as their after- 
effect, which materially increases the 
number of repetitions. 

The total repetitions made by each 
group, shown as the sums of the 
columns in Table III, indicate a 
slightly greater number of repetitions 
for the Right group and the fewest 
repetitions for the Wrong group. 
This is probably due to the one extra 


rewarded response and the one extra > 


punished response in these groups 
respectively. In any case, a chi- 
square test shows that the groups do 
not differ significantly from each other 
in total repetitions. 

The one significant result from 
Table III concerns the number of 
times that each of the 10 responses 
was repeated. An over-all chi-square 
testing the hypothesis that each re- 
sponse was repeated an equal number 
of times shows that the probability 
that this hypothesis is tenable is much 
less than .oo1r. It is to be noted that 
the frequency with which each re- 
sponse was repeated agrees fairly well 
with the frequency with which each 
response was initially chosen; com- 
pare the columns labeled Total in 
Tables II and III. 


IV. Discussion 


The principal result of this study is 
the lack of confirmation of Stone’s 
finding that punishment strengthens 


verbal connections. There seems to 
be no ready explanation for this differ- 
ence in results; as far as is known 
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Stone’s method was duplicated in 
every respect. However, we can 
afford to be cautious in attempting to 
reconcile differences in view of the 
present disagreement on the action of 
punishment. Postman (4) has rightly 
criticized some of the attempts that 
have been made to generalize the 
effects of punishment from one situ- 
ation toanother. There is a large gap 
between the announcement of 
“wrong” to a human S and the appli- 
cation of electric shock to a rat. 
Even if we stick to verbal connections 
and human Ss, there is no certainty 
that the £’s definition of punishment 
holds for the Ss as well. It is to 
Stone’s credit that he is aware that 
saying “wrong” may not belong to 
the class of events usually called 
punishments. He concludes, how- 
ever, that the word appeared to be an 
annoyer when it was applied and at 
the same time it had a strengthening 
effect (7). 

On intuitive grounds it seems para- 
doxical that an event should be an 
annoyer and yet strengthen the tend- 
ency to make the response which pro- 
duced the annoyance. Such a princi- 
ple seems to contradict the common 
belief in the adaptiveness of behavior. 
One suggestion from the present ex- 
periment may be pertinent here. It 
may have been a mistake to run the 
Ss only six trials. Reward seems to 
strengthen responses directly and to 
do so very early in learning. Ad- 
mittedly, punishment, or more cor- 
rectly, announcement of “wrong,” 
has no such clear-cut effects. We 
could hypothesize, then, that the 
action of punishment, whatever that 
action may be, does not occur, or at 
least is not measurable, until after 
some learning has taken place. Per- 
haps the Ss must be able to recall 
which responses were punished before 
they can deliberately vary their re- 
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sponses and eventually hit the correct 
one. Certainly in the present experi- 
ment the Ss could recall very few of 
the stimuli, and even less the responses 
made or whether those responses were 
punished. Possibly, then, before the 
Ss find out which connections are 
punished, punishment may have either 
no effect, as in this study, or even a 
slight strengthening effect, as in 
Stone’s work. Eventually, however, 
the wrong responses must give way to 
right ones if practice continues, and 
this may be due to being able to recall 
just which responses were called 
wrong. It will be noted that the 
above hypothesis is very similar to 
the Stephens-Tilton conclusion that 
punishment has little effect on weak 
associations but considerably weakens 
strong ones (6,11). ‘Thus, if learning 


had been carried turther than it was 
in this experiment, a more definite 
action of punishment might have been 


found. 

It is not surprising to find no spread 
of effect from punishment, since even 
the connection to whick. punishment 
was applied was neither weakened nor 
strengthened. But there was also no 
spread around the rewarded connec- 
tion, which was itself considerably 
strengthened by the reward. Two 
suggestions can be made concerning 
this result. First, as in the case of the 
ineffectiveness of punishment, there 
may have been too few trials for 
spread to appear; after six trials the 
rewarded connection was repeated 
only 24 percent more than the empiri- 
cal baseline of repetition. With more 
trials and consequently greater 
strengthening of the rewarded con- 
nection, a spread of the action of re- 
ward would be more likely to appear. 
Second, previous studies have con- 
sistently used punished connections on 
both sides of the rewarded one, 
whereas in this study the connections 
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preceding and following reward were 
without after-effect. Whether there 
is any significant difference between 
these two procedures is unknown. 

Finally, what about the definite 
response preferences which were shown 
to be operating from the very be- 
ginning of learning? Thorndike (10) 
once raised the question as to how 
response favoritisms might affect rep- 
etitions of rewarded responses as 
well as the spread of effect. In his 
attempts to determine a baseline from 
which to measure the strengthening 
action of reward he applied a correc- 
tion for the clear-cut biases in choice of 
response which he found in his Ss. 
He did not state, however, how he 
arrived at this correction, and appar- 
ently no subsequent investigators of 
spread of effect have even considered 
the problem. Jenkins and his associ- 
ates (1,2) have argued for the sig- 
nificance of response sequences and 
guessing habits among Ss, and in a 
recent study Jenkins and Cunningham 
(3) showed clearly that when the re- 
sponses to be chosen consisted of the 
numbers one to 10, the number S gave 
was very definitely related to the 
numbers previously used. Especi- 
ally was the response determined by 
the last number given. 

Jenkins and Cunningham present 
their number sequence data as evi- 
dence for their guessing-sequence hy- 
pothesis, which is used to explain 
gradients of effect. We would like to 
call attention to response preferences 
as another variable. With this vari- 
able in mind it would be necessary to 
analyze gradients of effect to deter- 
mine if punished responses close to a 
reward are actually responses which 
tend to be chosen, and therefore re- 
peated, more frequently simply be- 
cause they are preferred responses. 
Unfortunately, the data of the present 
experiment do not lend themselves to 
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such an analysis. First, of course, no 
gradients of effect were found, but the 
chief difficulty was that when repeti- 
tions were computed for each of the 
10 responses individually, using only 
connections which were near to a re- 
ward and were isolated, the number of 
repetitions were too small for reliabil- 
ity. There is no reason, however, 
why an experiment could not be de- 
signed specifically to measure the 
contribution of response favoritism to 
gradients of effect. If it were found 
that this variable exerts some influ- 
ence in this type of experiment, a more 
parsimonious interpretation of spread 
of effect would be in order. 


V. Summary 


1. Four groups of 30 Ss each were 
read a list of 39 words under instruc- 
tions to learn which number between 
one and 10 was associated with each 
word. After each response by S, E 
said “right,” or “wrong,” or some 
meaningless word, or nothing at all. 
With the exception noted below, the 
four after-effects were assigned at 
random to the stimuli and remained 
the same regardless of S’s responses. 

2. One of the stimuli in the approxi- 
mate middle of the list was isolated by 
being both preceded and followed by 
five stimuli which had no after-effect 
at any time for any of the Ss. Then 
the four groups of Ss were differenti- 
ated by the type of after-effect given 
to the isolated stimulus. 

3. The group of Ss with no after- 
effect to the middle 11 stimuli per- 
mitted the construction of an empiri- 
cal baseline of response repetition. 
The only significant deviation from 
this baseline occurred when the re- 
sponses to the isolated stimulus were 
rewarded; saying “right” produced a 
significant gain in response repetition. 
Punishing the responses to the isolated 
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stimulus by saying “wrong,” or giving 
a meaningless after-effect, produced 
neither a significant increase nor de- 
crease of repetition. Thus, the hy- 
pothesis that punishment slightly 
strengthens verbal connections was 
not confirmed. 

4. When repetitions of the responses 
made to the stimuli preceding and fol- 
lowing the isolated stimulus were com- 
puted, there was no evidence of a 
spread of effect from either reward or 
punishment. 

5. Another measure of the action of 
the consequences of a response was 
evolved; this was the number of 
different responses made to a stimulus, 
which was suggested as a measure of 
response variability. When this 
measure was applied to the data, the 
results duplicated those found for 
response repetition; there was a sig- 
nificant reduction in response vari- 
ability only in those responses directly 
rewarded. 

6. It was shown that definite re- 
sponse preferences operated even on 
the first trial and that the responses 
more frequently chosen initially were 
also more frequently repeated 
throughout learning. It was sug- 
gested that this finding may have a 
bearing on attempts to account for the 
spread of effect. 


(Manuscript received June 13, 1949) 
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PSYCHOMOTOR PERFORMANCE AND DISTRIBUTION 
OF PRACTICE * 


BY ARTHUR J. RIOPELLE 
University of Wisconsin 


In this experiment studies have been 
made of the specific quantitative 
effects of the conditions of relative 
massed and distributed practice on 
psychomotor performance. The pri- 
mary objectives of the study have been 
to assess the relationship between 
massed and spaced practice, and: (a) 
level of performance; (b) individual 
differences in performance; (c) incre- 
ments in performance during and be- 
tween test periods; (d) reliability of 
performance scores; (e) predictability 
of final performance from _ initial 
ability; and (f) the relationship be- 
tween initial performance and overall 
gain during the learning. 

Most studies on distribution of 
practice have been concerned with 


differences in level of performance as- 
sociated with rest periods between 


learning trials. Recent experimental 
work, however, has been pointed to- 
ward aspects of the problem other 
than overall level of performance (2, 
8, 10,12). Although superiority of 
distribution of practice has been gener- 
ally found, the advantage is by no 
means universal (3, 4,6). The supe- 
riority of one degree of distribution of 
practice over another is apparently 
dependent upon the nature of the 
task. Numerous problems, especi- 
ally with reliability of performance in 
training, have yet to be examined in 
regard to spacing and distribution of 
practice periods in learning. 


MeTHOoD 


Apparatus.—The apparatus employed to 
make the present study was the Vector Complex 


' The author wishes to express his appreciation 
to Dr. Karl U. Smith for helpful counsel in the 
conduct of this study. 


Reaction Time test.2 The test attempts to 
measure the proficiency with which persons re- 
spond to rapidly changing stimulus patterns, the 
relationships of which, in themselves, are readily 
understood. The basic relationship to be 
learned consists primarily of a ‘left of,’ ‘right of,’ 
‘above,’ or ‘below’ relationship. These con- 
figural patterns are set up on a panel by means of 
a given arrangement of colored lights, correct 
response to which sets up a new light pattern. 
The subject is presented with an L-shaped in- 
strument (Fig. 1) displaying a vertical stimulus 
panel on which are shown four groups of five 
white lights each arranged in a pentagonal pat- 
tern. Within each bank of five lights are located 
a red and a green light. Four banks of five 
switches are located on the horizontal response 
panel. These are arranged in a diamond pat- 
tern, while the groups of lights mark the four 
corners of a square. When a pattern composed 
of a red, a green and a white light is presented to 
the S, the problem is solved by turning a particu- 
lar switch on the response panel. The deter- 
mination of the correct switches depends upon 
the relation of the red to the green light. For 
example, if the glowing red light is above the 
green light, the switch to be turned is located in 
the upper bank of switches. The specific switch 
to be turned corresponds in position with that 
of the glowing white light. Thus, in the example 
given, if the uppermost white light is glowing, 
the correct switch is the uppermost position of 
the upper bank of switches. The glowing white 
light is always in the same bank of lights as the 
glowing red light. After 40 correct responses 
are made, the series of stimulus patterns is 
repeated. The score on the test is the number of 
correct switches turned in a testing period. 
Subjects and procedure.—Forty-three Ss were 
used in this investigation. All were male 
students drawn from the class in elementary psy- 
chology at the University of Wisconsin. 
Twenty Ss were tested under the conditions of 
massed practice and 23 under spaced practice. 
A total of 40 two-min. trials were given to each 
S. The spaced-practice group received four 
trials per day for a period of 10 days, while the 
massed-practice group received 40 consecutive 
trials all on one day. For ease of comparison 
each group of four trials constituted a ‘test 


* Available from Vector Manufacturing Co., 
5615 Lawndale, Houston 3, Texas. 


390 








DISTRIBUTION OF PRACTICE 


Fic. 1. 


period.” Each trial lasted two min., and the 
trials were separated by a 55-sec. rest and a five- 
sec. warning period. The warning consisted of 
the ringing of a bell for five sec. The Ss were 
not permitted to know the results of their per- 
formances until after the completion of the 40 
trials. All Ss were tested individually. 


RESULTS 


Level of performance and individual 
differences—The mean numbers of 
correct reactions per test period are 


presented in Fig. 2. Inspection of 
this figure reveals that for both groups 
the level of performance increased 
throughout the experiment, and that 
the curve for distributed practice is 
consistently above that for massed 
practice. These data also show that 
the differences between the two groups 
increase with further practice through- 
out the range of the present study. 
The trends in performance for the two 


The Vector Complex Reaction Time apparatus 


groups are highly significant (P < 
.001) when evaluated by the Alex- 
ander (1) trend test. 

The present results show the supe- 
riority of distributed practice over 
massed practice conditions in learning. 
This difference may be accounted for 
in terms of known phenomena them- 
selves, e.g. the failure of the massed 
practice conditions to permit the 
operation of between-trial or inertia- 
like learning effects in performances in 
which such effects are known to occur. 
The generally increasing difference 
between the two groups also may be 
said to be in line with theories postu- 
lating such factors as work decrement 
(9) which accumulates most rapidly 
under massed practice. These data 
are similar to those of Kientzle (ro) 
who, from constants derived from 
fitted curves, found that the curves for 
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groups with longer rest intervals rose 
slowly to high maxima while those 
with shorter rest pauses rose relatively 
quickly to low maximum mean scores. 
Similar results were obtained by 
Nance (12). Lorge’s (11) prediction 
that “. . . observed superiority in 


favor of a distributed practice group © 


will become less [with practice ] until 
eventually such superiority becomes 
zero”’ is not supported. 

The magnitude of the individual 
differences (Fig. 2), as measured by 
the o, increased throughout training. 
The o’s showed a gradual increase for 
both groups, but at any stage of train- 
ing were essentially equal for the two 
conditions, as none of the F-values 
was statistically significant. The 


curves of Fig. 2 indicate that for both 
conditions of spacing, the individual 
differences in performance first in- 
crease, then level off and finally in- 
crease again. 


The relative variability 
for each condition decreased, but that 
for distributed practice decreased at a 
faster rate since level of performance 
for that group increased more rapidly. 
It is a frequent finding that group 


---* 
—_ 


a= ~ 
an” watmeutCe 


—— 


LEVEL OF PERFORMANCE 


a) 





7.20 


orsTeoutee _- 
=—t--<-* 


ae 


NUMBER OF REACTIONS 


STANDARD OEVIATION 





« 8s 6 tT 6 98 #8 
SUCCESSIVE TEST PERIODS 


Fic. 2. Differences between massed and 
distributed practice in psychomotor perform- 
ance: mean number of correct reactions and 
standard deviations at successive test periods 
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variability increases throughout learn- 
ing while relative variability decreases. 
Ellis (5) has recently reviewed the 
literature on relative variability and 
indicates a mechanism for changes in 
variability based on the addition of 
variables. Kientzle (10) found vari- 
ability to increase more rapidly with 
distribution of practice than with 
massing of practice among subjects 
printing the inverted alphabet. She 
found no change in variability under 
massed conditions. Bell (2) and Hil- 
gard and Smith (8) report an increase 
in variability with added trials. The 
relationship between mean and stand- 
ard deviation did not vary systemati- 
cally as a function of the degree of dis- 
tribution of practice. Their studies, 
like this one, did not make use of 
absolute massing. The essentially 
comparable increases in variability as 
a function of practice with the two 
distributions of practice employed in 
the present study when related to the 
above studies suggest that the exist- 
ing relationship depends upon the 
specific task. 

Intra- and inter-day gains (distrib- 
uted practice)—For the distributed 
practice group it is fruitful to compare 
the gains within test periods (intra- 
day gains) with the overnight incre- 
ments in performance (inter-day 
gains.) Since each test period con- 
sisted of four trials, the differences be- 
tween the fourth trial of one test 
period and the first trial of the follow- 
ing test period were used as a measure 
of the between-day gains. The intra- 
day gain was computed as the differ- 
ence between the first and fourth trial 
score on a given day. There is a 
necessary correlation between these 
two measures due to unreliability of 
the measurement, but with high reli- 
abilities the correlation is minimal. 
This follows from the fact that the 
fourth trial score partly determines 
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Fic. 3. Intra- and inter-day gains in per- 
formance under distributed practice 


both the within-day and between-day 
gains.’ 

The intra- and inter-day gains are 
plotted in Fig. 3. From this figure it 


will be seen that the within-day gains 
for distributed practice are typically 
greater than the between-day gains. 
The absolute amount of superiority of 


intra-day gains over inter-day gains 
is, in part, a function of the lengths of 
the two periods. The general shapes 
of the two curves are similar to the 
extent that both show an initial high 
improvement in performance and 
lesser gains later in learning. Im- 
provement in performance resulting 
directly from practice and from inter- 
polated rest intervals is greatest in the 
initial stages of learning. The high 
initial overnight gain and the low 
later gains is related to the finding by 
Hilgard and Smith (8) of positive 
overnight gains early in practice and 
negative gains later in practice. 

The consistently positive within- 
day gain probably indicates that some- 
thing similar to a ‘warm-up’ effect is 
operating. It is interesting to note, 
however, that a plot of the within-test 


* If the fourth trial score is high by chance, 
the within-day gain is inflated and the between- 
day gain is suppressed. The resulting correla- 
tion will be negative. 


period gains for the massed practice 
group would yield a curve running the 
same general course as that for dis- 
tributed practice. 

Reliability—Test-retest _ reliability 
at all stages of learning for the two 
groups was consistently high. Total 
scores on the first test period corre- 
lated +.86 with those on the second 
test period for each group. Correla- 
tions between subsequent pairs of test 
periods for both groups ranged from 
+.91 to +.97. 

Predictability of later performance 
from initial performance.—Correla- 
tions between initial level and per- 
formance at specific stages later in 
training were computed for both 
groups. These correlations are 
plotted in Fig. 4. The trends of the 
correlations indicate that initial per- 
formance predicts later periods of per- 
formance less well as the final stages 
of practice are reached. As indicated 
by the above reliability measures, 
scores on test period 1 correlated +.86 
with those on test period 2. These 
correlations decrease as one attempts 
to predict later and later performance 
from initial performance. It appears 
that predictability under the distrib- 
uted practice condition is about as 
good as it is under the massed practice 
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Fic. 4. Differences between massed and dis- 
tributed practice in psychomotor performance: 
change in level of correlation between initial and 
successively later test periods 
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condition, especially until the eighth 
test period. Subsequent to this pe- 
riod the differences between the cor- 
relations increase. The correlation 
between the first and tenth test pe- 
riods is +.63 for distributed practice 
and +.37 for massed practice. Be- 
cause of the small number of cases, 
however, a difference this large fails 
to attain the usually required level of 
statistical significance. 

Kientzle (10) has reported that 
correlations between pairs of trials 
decrease with increasing separation, 
but that these correlations were not 
affected by differing degrees of dis- 
tribution of practice. The number of 
trials given in that experiment was 
markedly reduced for the long rest 
interval groups; hence her results 
cannot be considered as data contrary 
to those of this experiment. 

Relation between initial level and 
gains.—So that an evaluation may be 
made of the relation existing between 
initial level and overall gain, these two 
factors were correlated for the two 
groups separately. Overall gains 
were computed as the difference be- 
tween the first trial score of the first 
test period and the fourth trial score 
of the tenth test period. When ac- 
count is taken of the reliability of the 
scores (7), the resulting coefficient was 
+.31 for the distributed practice 
group and +.03 for the massed prac- 
tice group. Again, because of the 
small sample employed, a correlation 
of +.31 is not statistically significant. 


SUMMARY AND CONCLUSIONS 


The psychomotor learning of two 
groups of college male sophomores was 
investigated in terms of performance 
on the Vector Complex Reaction Time 
test under the conditions of massed 
(N = 20) and distributed (N = 23) 


practice. The massed group received 
40 trials in one day while the distrib- 
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uted practice group received four 
trials per day for 10 days. The re- 
sults of this investigation indicate 
that the following conclusions may be 
drawn. 

Throughout training the level of 
performance increased for both groups. 
The level of skill of the distributed 
practice group was consistently supe- 
rior to that of the massed practice 
group, and the difference between the 
two groups increased as learning 
progressed. Individual differences 
within each group increased during 
training by the same amount. 

Inter- and intra-day gains for dis- 
tributed practice were greatest early 
in training. Both gains were greatest 
in the initial stages of learning. Al- 
though gains were smaller later in 
training, they were, nevertheless, still 
positive. Intra-day gains were typi- 
cally greater than inter-day gains. 
This was due, in part, to the length of 
the test period. 

Consistency of individual perform- 
ance, as indicated by correlations 
between scores on successive test 
periods, was consistently high in both 
groups at all stages of training. 

A major problem of this study has 
been the evaluation of certain aspects 
of prediction in this learning task. 
Initial level of skill was found to 
predict later level with decreasing 
efficiency as the number of intervening 
test periods increased. Whereas the 
correlation between scores on the 
initial test periods was +.86, the cor- 
relation between the first and tenth 
test period scores was +-.37 for massed 
practice and +.63 for distributed 
practice. The decrease in correlation 
between initial and later scores in 
spite of the high reliability of the 
measures indicates that factors, not 
yet identified, enter into the deter- 
mination of the individual scores; and 
that these factors result in a gradual 
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rearrangement of the subjects as to 
level of skill. The above mentioned 
correlations suggest that these factors 
play a greater role under conditions 
of massed practice than they do under 
distributed practice conditions. 

The difference in predictive effi- 
ciency of initial scores in the massed 
and distributed practice conditions, as 
just summarized, is found to corre- 
spond with the efficiency of the initial 
scores in these conditions in predicting 
overall gain. For massed practice 
initial scores show a correlation of 
+.03 with gain, for distributed prac- 
tice the correlation is +.31. The 
relative correspondence between these 
values and the values found for the 
correlation of initial and final scores 
suggests that a consistent effect occurs 
under the two conditions as far as 
predictive relations are concerned, 
although the differences between cor- 
relations in the two conditions do not 
fulfill a rigid criterion of statistical 
significance. 

The results described, particularly 
with reference to the systematic re- 
arrangement of the subjects which re- 
sults from practice, and possibly the 
differences in rate of rearrangement 
as a function of the distribution of 
practice, point up aspects of learning 
that need to be considered in adequate 
theories of learning. 


(Manuscript received May 26, 1949) 
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A COMPENSATORY EFFECT IN VOCAL RESPONSES 
TO STIMULI OF LOW INTENSITY’ 


BY JOHN W. BLACK? 
Kenyon College 


A series of studies has shown that in 
responding to messages talkers tend 
to speak more intensely as the level of 
the stimulus is increased. Increments 
of amplification in these instances 
were 16-20 db (1). Invariably with 
the lowest level of stimulus approach- 
ing the threshold of intelligibility 
there was no significant difference be- 
tween the mean intensities of the re- 
sponses to the two least intense levels, 
and typically the mean response to 
the second level was numerically less 
than the mean response to the lower 
one. 

The possibility arises that in a 
compensatory manner the responses 
to just-perceptible stimuli are more 
intense than to ones heard slightly 
more clearly, an exception to the gen- 
eralization that messages are said 
back with an intensity that increases 
with higher levels of stimuli. The 
study reported here treated this 
possibility. 

PROCEDURE 

Five lists of eight words each were recorded 
by one voice. In each list the last four words 
were antonyms of the first four, for examples, 
full, narrow, sister, many, empty, wide, brother, 
ew. 

: Twenty-five male Ss heard the five lists twice 


as they were played back through earphones. 
S heard the instructions: 


You will hear lists of eight words each. As 
soon as you hear a word please say it. Talk 
naturally. We shall now practice. ‘pad’ 

. . ‘table.’ That’s right. Talk naturally. 
Say only what you hear. A word will not be 
given until you have answered the preceding 
one. There is no rush. Now you will hear 


1 Work conducted at the School of Aviation 
Medicine, N.A.S., Pensacola, Florida, for U.S.N. 
Special Devices Center under Contract N7onr- 
411 with the U.S.N. Office of Naval Research 
and Kenyon College. 

* Now at The Ohio State University. 


the words. Watch the monitor for further 
instructions. 
Also he heard similar instructions with ‘say the 
opposite of the word’ instead of ‘say it,’ and ap- 
propriate changes in the context. 

After the directions were heard once a monitor 
held up a card indicating repeat or opposite before 
succeeding lists were presented. 

S and a monitor were in a sound-treated room. 
S was positioned with two microphones eight in. 
from his lips. One microphone led through an 
amplifier to a graphic level recorder (Sound Ap- 
paratus Co.); the other to a sound level meter 
(General Radio Co.). The playback and level 


recorder were in an adjacent room. The moni- 


tor read the deflections of the sound level meter 
in the room with S, although out of his view. 

The order of lists was constant throughout the 
experiment. The order of experimental condi- 
tions (intensity of play back and repeat vs. op- 
posite) was counterbalanced. 

Prior to hearing an experimental list, S heard 
successive lists of 12 words at low levels and re- 
peated the items. The lists were equally intel- 
ligible, and were recorded under the same condi- 
tions as the experimental lists. The level at 
which S repeated half a list correctly was arbi- 
trarily designated threshold. The lowest experi- 
mental intensity (level 1) was five db above 


TABLE I 


Summary or ANALYsIs oF VARIANCE OF 
Intensity or Orat Responses 
(Repetitions anp Oppostres) 
to Strwucus Worps aT Five 

Levets or INTENSITY 








Source of 
Variation 


Sums of 
Squares 


Mean 

Square 
Intensity (1) 
Types (T) 
Subjects (S) 


IxXT 
Ixs 
SxT 
SxTxI 


Total 





143.07 
7-33 
18948.24 


8.35 
1229.70 
210.74 
466.89 


21014.32 


35-77 
7-33 
789.51 
2.09 
12.81 
8.78 


4.86 

















* Significant at 5% level of confidence. 
** Significant at 1% level of confidence. 
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TABLE Il 
Retatirve Mean Intensity or Responses to Five Levers or Inrensrry or Stiwuut.' as Ss. 





Level of Stimuli 





I 
(Threshold +5 db) 


2 
(Lev. 1 +6 db) 


3 4 3 
(Lev. 2-+6 db) | (Lev. 3+6 db) | (Lev. 446 db) 





Relative intensity of 
responses (db)* 2.4 2.0 








7 o** 














* +, any difference between means of 1.6, significant at 1%; 1.3, 5%. 
** o db on this scale of relative measurement equals 69 db on the General Radio meter. 


threshold. The remaining four conditions were 
in 6-db increments from level 1. 

An analysis of variance was made of the data, 
treating intensity levels, types of messages, and 
Ss as the main variables. The mean intensity 
of S’s eight responses in each experimental condi- 
tion was used as the basic measure. 


RESULTS 


One analysis of variance is sum- 
marized in Table I. The data on 
which this analysis was performed 
were taken from the graphic level 
recorder.’ The analysis gave no basis 
for believing that repeat-opposite 
messages were said at different inten- 
sities. The ratio of the variances 
attributable to the Ss X intensity 
interaction and triple interaction was 
highly significant, indicating vari- 
ability among the Ss in their responses 
to the experimental conditions of 
intensity. With the interaction value 
(S x I) as error, the variance estimate 
attributable to intensity was signifi- 
cant (F = 2.79; 5% = 2.46; 4 and 
96 df). 

Table IJ lists the relative mean 
values of vocal intensity of the re- 
sponses to each of the five experi- 
mental conditions of intensity, the 
lowest measure being arbitrarily as- 
signed a-value of zero. ‘The range of 
mean values of responses decreased as 
the stimulus levels increased through a 
range of 18 db. The three lowest 
levels of stimulus elicited mean re- 


® The set of data from the sound level meter 
yielded comparable results. 


sponses that were highly significantly 
more intense than the mean of the 
responses to level 4. 


SuMMARY AND CONCLUSIONS 


With gross increments (16-20 db) 
in the intensity of stimuli, oral re- 
sponses, whether repetitions or orig- 
inal speech, increase in intensity. 
The pattern is not linear, and is re- 
ported only with reference to levels of 
intensity ranging upward from ap- 
proximately 30 db above threshold. 
In these earlier studies there has been 
the possibility that with very soft 
stimuli more intense speech might 
occur in response to faint rather than 
in response to somewhat more intense 
stimuli. The present study indicates 
the probability that this is the case. 
The compensatory adjustment in 
intensity to barely intelligible words 
(stimuli) is less dramatic in extent 
than the opposite effect with stimuli 
of greater intensities, namely in- 
creased intensity in vocal output ac- 
companying the stimuli of greater 
intensity. Nevertheless, apparently 
there is a real compensation and in- 
tensities of responses increase as the 
verbal stimulus approaches the thresh- 
old of hearing in intensity. 


(Manuscript received June 2, 1949) 


REFERENCE 


1. Brack, J. W. Loudness of speaking: The 
effect of heard stimuli on spoken re- 
sponses. J. exp. Psychol., 1949, 39, 
311-315. 





satnaestias, Hekate IOSE 


ge PIS ee EMIT Y FE 





THE DESTRUCTION OF THE MULLER-LYER 
ILLUSION IN REPEATED TRIALS: II. 
SATIATION PATTERNS AND 
MEMORY TRACES 


BY WOLFGANG KOHLER AND JULIA FISHBACK 
Swarthmore College 


In the first part of our report (7) we 
have shown that the destruction of the 
Miller-Lyer illusion in repeated trials 
is probably caused by satiation rather 
than by learning. We will now dis- 
cuss observations which seem to re- 
late satiation to certain facts in the 
field of memory. 

Both Judd (3,4) and Lewis (9) 
found that under the conditions of 
their experiments the destruction of 
the M.-L. illusion was a very slow 
process. But they also observed that 
the development could not be acceler- 
ated by a schedule of many trials 
On the 


given in a rapid sequence. 
contrary, after unavoidable interrup- 
tions of the experimental program the 
illusion often decreased with greater 
speed. Judd therefore gave one of 
his Ss fewer trials in a session, and also 
very long rest periods between ses- 


sions. As a result, the total number 
of trials needed for the destruction of 
the illusion was considerably smaller 
than it had been with other Ss. One 
of Lewis’ Ss showed practically no 
decrease of the illusion until, after 10 
days of experimentation, days of rest 
were given between experimental 
days; after this change the illusion 
could be destroyed without any 
difficulty. 

It will hardly be assumed that the 
factors responsible for the destruction 
operate with increased force in the 
absence of the pattern. Hence, such 
facts clearly indicate that long se- 
quences of trials establish an obstacle 
to the very development which they 


are supposed to bring about, and that 
this obstacle tends to disappear during 
rest periods of sufficient length. Our 
own observations agree entirely with 
this thesis. In fact, it was precisely 
findings of this kind which caused us 
to vary our schedule of experimenta- 
tion quite often, and sometimes radi- 
cally. As a result, the impeding in- 
fluence of crowded trials and the 
favorable effect of rest periods became 
obvious throughout our work, partic- 
ularly since we derived the temporary 
size of the illusion from small groups 
of individual measurements, and could 
therefore follow developments also in 
detail. It will suffice if we give a few 
typical examples. 

On her first day of experimentation, 
Subject D was given 60 individual 
trials, the results of which are shown 
in the curve of Fig. 1 as 14 successive 
averages. Early in the sequence, the 
illusion seems to decrease; but then its 
size grows again, and fluctuates about 
a somewhat higher level, so that little 
seems to be gained at the end of the 
day. On the next day, however, the 
illusion is at once greatly reduced, 
and presently it decreases still further 
under a regime of only a few trials in 
a session. Clearly, during the first 
day more has been achieved than ap- 
pears at the time in our measurements. 
It seems indeed that a sequence of 
many trials establishes some condition 
which obscures the actual achieve- 
ment, that this condition largely dis- 
appears from the first day to the next, 
and that, as a consequence, the size of 
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Fic. 1. Results during the first experimental 
days of Subject D. Broken lines indicate tran- 
sition from one day to another. Ordinate: illu- 
sion in percent. Abscissa: successive groups of 


four trials. 


the illusion appears to drop over 
night. In the curve of Subject A 


(Fig. 2) we find two such accelerations 
from one series of trials to the next. 
In this case, rest periods of two days 


were given between successive ses- 
sions. Incidentally, the second ac- 
celeration occurs when a negative 
illusion has already been extablished. 
This fact tends to show that the proc- 
esses which operate during this stage 
of the process are essentially the same 
as those which make the positive illu- 
sion decrease. 

The obstacle which develops when 
many trials are given without rest 
periods does not merely delay the 
process by which the illusion is de- 
stroyed. Often, its disturbing effect 
becomes so strong that for a while the 
direction of the development is actu- 
ally reversed. Instances of this phe- 
nomenon are shown in the curves of 
Subjects E and C (Figs. 3 and 4) which 
give the results of the first three ex- 
perimental days of Subject E, and of 
the first two days of Subject C. 
According to these curves, the size of 
the illusion tends to be greater toward 
the end of a day’s triale than it is in 
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some earlier parts of the session. 
Many further examples could be 
given; with most of our Ss the direc- 
tion of the process was temporarily 
reversed in this fashion. As we have 
just seen, this may already happen 
during the first days of experimenta- 
tion; but the same tendency is often 
greatly enhanced in later trials. For 
this reason, we gave some Ss only a 
few trials each day when many experi- 
ments had already been done before. 
With larger groups of trials, the de- 
velopment always reversed its direc- 
tion. Actually, the effect of many 
experiments in an immediate sequence 
is so strong that, when the illusion has 
been destroyed, it may be brought 
back to life again merely by giving 
long series of further experiments. 
We have done this once with Subject 
C. It seems, however, that Ss differ 
in this respect. In the case of Sub- 
ject B, the direction of the develop- 
ment was hardly ever convincingly 
reversed. 

Obviously, such observations re- 
semble well-known facts in the field 
of learning, namely, the inhibitions 
which make it difficult to memorize 
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Fic. 2. Results during the first experi- 
mental days of Subject A. Broken lines indi- 
cate transition from one day to another. Ordi- 
nate: illusion in percent. Abscissa: successive 
groups of four trials. 
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Fic. 3. 
from one day to another. 


monotonous series of items, or to estab- 
lish a precise motor performance in 
often-repeated practice trials. In the 
case of pursuit skills, the similarity 
goes so far that when at some point 
the crowding of trials is increased the 
number of errors actually grows, a 
reversal just like the one often ob- 
served in our experiments. Some 
curves which Renshaw and Schwarz- 
bek (10) have given in order to demon- 
strate this phenomenon might just as 
well represent observations of our own. 
Again both our experiments and those 
on learning when similar activities 
are often repeated exhibit the counter- 
part of such disturbances: after rest 
periods, results tend to be greatly and 
fairly abruptly improved. Not only 
the obstacle which delays destruction 
of the M.-L. illusion but also the in- 
hibitions which accompany some 
forms of learning are therefore to a 
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Results during the first experimental days of Subject E. Broken lines indicate transition 
Ordinate: illusion in percent. 


Abscissa: successive groups of four trials. 


degree reversible. Reminiscence as a 
certain improvement of recall when 
tests are not given immediately is, of 
course, merely a special form of the 
same fact. 

We do not believe that this agree- 
ment between our observations and 
those in the field of learning is merely 
accidental. But if it is not accidental, 
are we not forced to admit that, after 
all, the ‘practice’ effect deserves its 
name, i.e., that the M.-L. illusion is 
actually destroyed by learning? In 
what other fashion could the agree- 
ment be explained? 

We cannot accept this argument. 
Whatever else may follow from the 
agreement, it can hardly mean that 
the destruction of the M.-L. illusion 
in many trials is a matter of learning. 
Too many facts argue against this 
thesis. To repeat, the illusion has 
often been destroyed when Ss knew 
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Fic. 4. Results during the first experimental 
days of Subject C. Broken lines indicate tran- 
sition from one day to another. Ordinate: 
illusion in percent. Abscissa: successive groups 
of four trials. 


neither that they were dealing with an 
illusion pattern nor how their measure- 
ments compared with the objective 


situation. Moreover, according to 
Lewis the process is not accelerated 
when Ss are informed that they are 
confronted with an illusion. In our 
own experiments we have shown that 
the illusion can also be destroyed when 
actual comparisons are largely re- 
placed by mere fixation periods, dur- 
ing which Ss cannot learn to compare 
better. When, after the disappear- 
ance of the illusion, experiments are 
continued, a negative illusion can be 
established. When the illusion has 
been destroyed under the conditions 
of our experiments, another pattern, 
from which the main characteristics 
of the M.-L. figure are absent, but 
which coincides with the illusion pat- 
tern, proves to be changed in the 
same direction. The effect which re- 
peated trials have on the M.-L. illu- 


sion is to a high degree localized. 
While all these observations seem to 
be incompatible with the practice 
theory, they are readily understand- 
able if it is satiation which destroys the 
illusion. Moreover, any observation 
for which an explanation in terms of 
learning is possible has so far been 
found to be explainable also in terms 
of satiation. Under the circum- 
stances, it seems advisable to postpone 
discussion of the agreement between 
obstacles to the destruction of the 
illusion and inhibitions observed in 
certain forms of learning until we have 
seen whether perhaps those obstacles, 
too, can be derived from the known 
behavior of satiation. 

When explaining the fact that, after 
the disappearance of the illusion in a 
given position, it tends to be more 
rapidly destroyed in a new position of 
the pattern, we made use of a simple 
principle: once strong satiation has 
been established in certain areas, 
satiation by another pattern shown in 
the same region must be accelerated in 
areas which have so far been less 
affected. The principle is a direct 
consequence of the one by which 
figural after-effects have been ex- 
plained. If it is true that such an 
effect is observed when the current of 
the test object is weakened in highly 
satiated areas, and intensified in 
others, then upon prolonged presenta- 
tion of the test object the satiating 
effect of its current must be acceler- 
ated in these other areas. But this 
principle applies not only when, after 
satiation by a first pattern, a new pat- 
tern is introduced, and now develops 
further satiation. The same reason- 
ing holds also for continued or often 
repeated satiation by a given pattern. 
In the M.-L. figure, for instance, cur- 
rents issue from the lines of the angles, 
and spread in their environment. 
These currents are more intense in the 
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interior of the angles than they are on 
the other side of the lines. Conse- 
quently, satiation develops faster 
within the angles than it does in the 
areas toward which their corners 
point, whereupon the currents are 
deflected into the latter areas, and 
the illusion begins to decrease. But 
suppose that the pattern is inspected 
for a very long time, or in a great 
many crowded trials. The more the 
current is deflected from the interior 
of the angles and into the areas on the 
other side of the corners, the stronger 
must satiation now become in these 
areas also. This means that we must 
distinguish between satiation in the 
‘right’ places, i.e., those in which it 
must grow if the illusion is to be de- 
stroyed, and satiation in the ‘wrong’ 
places, i.e., those in which it now also 
grows, precisely because the others 
have already been strongly satiated. 
Thus, gradual satiation in the ‘wrong’ 
places is a natural consequence of very 
strong satiation in the ‘right’ places. 
We call the former ‘wrong’ because 
satiation of these places obviously 
tends to prevent further transfer of 
currents in their direction, and hence 
also a further decrease of the illusion. 
There is no reason why the process 
should not eventually be stopped in 
this fashion. In fact, when a stage 
is reached at which satiation in the 
‘wrong’ places grows faster than sati- 
ation in the ‘right’ places, the change 
in the size of the illusion must even 
reverse its direction. We may add 
that the pattern will be particularly 
sensitive to this effect of crowded 
trials when the number of preceding 
trials is already very great—a predic- 
tion with which our observations also 
agree.! 


1 Obviously, the present argument must be 
considered when the influence of fixation upon 
the destruction of the illusion is being discussed. 
We found that, to a degree, fixation is likely to 
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The obstacle which tends to prevent 
destruction of the illusion has thus 
been derived from the theory of sati- 
ation. No new assumptions have 
been introduced for the purpose. It 
remains to be shown that the obstacle 
must tend to disappear when long rest 
periods are given. This fact, again, 
follows from the behavior of satiation 
in general. When satiation is weak, 
figural after-effects can be demon- 
strated only for very short periods. 
Obviously, weak satiation soon sinks 
below the level at which it can affect 
a test object. On the other hand, 
more thorough satiation by a given 
pattern, particularly when it has been 
established in many inspection pe- 
riods, remains effective for surpris- 
ingly long periods. Now, satiation in 
the ‘right’ places is caused by more 
intense currents, and begins to de- 
velop earlier, than satiation in the 
‘wrong’ places. In fact, while the 
rate of further satiation in the ‘right’ 
and the ‘wrong’ places may tempor- 
arily be reversed, there is no reason 
why the absolute level of satiation in 
the ‘wrong’ places should ever ap- 
proach the level in the ‘right’ places. 
It follows that during a long rest 
period satiation in the ‘wrong’ places 
can gradually decrease, and that 
therefore the obstacle can largely dis- 
appear, while satiation in the more 
intensely affected ‘right’ places re- 
mains very strong. If this happens, 
first measurements after the rest pe- 
riod are likely to show an abrupt re- 
duction of the size of the illusion, be- 
cause now the currents of the pattern 
can freely turn in the required di- 
rection. This is the fact which Judd 


accelerate the process, because it accelerates 
satiation in the ‘right’ places. We must now 
add, however, that precisely for this reason 
fixation will soon accelerate satiation in the 
‘wrong’ places also, so that, in the long run, it 
will have both an impeding and a favorable 
influence. 
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and Lewis so often observed, and 
which we could confirm in our experi- 
ments. It has now also been derived 


from our general premises. 


Satiation in ‘wrong’ places cannot be 
restricted to the particular case of the 
M.-L. figure. Similar obstacles are 
bound to arise when other patterns are 
repeatedly shown for long satiation 
periods, and when therefore satiation be- 
gins to spread into ‘wrong’ areas. But 
once satiation grows to comparatively 
high levels in these areas, the size of 
corresponding figural after-effects is likely 
to be reduced. One must therefore ex- 
pect that figural after-effects will not in- 
variably be increased when very long 
inspection periods are given repeatedly, 
and when at the same time the satiating 
currents are intense. This is a further 
conclusion which can be tested when 
patterns involve no illusions. It hap- 
pens that results of a recent investigation 
refer directly to the present issue. Free- 
burne and Hamilton (1) experimented 
with certain figural after-effects when the 
brightness difference between the pat- 
terns and their background was either 
fairly great or considerably smaller.? 
In six sessions, the inspection patterns 
were given repeatedly on six successive 
days. The authors assumed that no 
accumulation of satiation would occur 
from one day to the next. But since 
each inspection period lasted five min., 
this expectation does not agree with our 
own experience in this field. The as- 
sumption is also contradicted by their 
results which seem to follow directly from 
our present argument. As experiments 
were continued, the weaker figures gave, 
on the whole, progressively better results. 
It seems particularly important, how- 
ever, that just the opposite happened 
when the brighter figures were used. 
While in this case effects had been par- 
ticularly marked in the beginning, results 

* We do not see clearly why the authors ex- 
pected to find stronger after-effects when the 
brightness difference was smaller. K. and W. 
have shown that so long as satiation is not too 
strong the opposite actually happens, as it must 
for very simple reasons (8, p. 330). 
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became less conspicuous as experiments 
were repeated. We are at present con- 
cerned only with this finding. It agrees 
entirely with our thesis that obstacles to 
figural after-effects must develop when 
long periods of intense satiation are given 
too often, and when therefore the ‘wrong’ 
places are also considerably affected. 
No ‘recovery’ occurred from one day to the 
next, obviously because in these experi- 
ments the satiation of ‘wrong’ areas was 
so strong that 24 hours did not suffice as a 
rest period. If after the six experimental 
days with the bright objects a very long 
rest period had been given, first results 
after this period might have shown an 
improvement. 


Now that both the obstacle to the 
destruction of the illusion and the 
spontaneous disappearance of this 
obstacle have been given an explana- 
tion in terms of satiation, we can re- 
turn to the question why phenomena 
of the same kind are also observed in 
certain forms of learning. We could 
not accept the conclusion that the 
illusion is destroyed by learning. But 
we should be inclined to agree if it were 
now suggested that, on the contrary, 
such phenomena occur during learn- 
ing processes because these processes 
are accompanied by considerable sati- 
ation. There is no reason to believe 
that only perceptual facts in the strict- 
est sense can cause this condition; 
moreover, actual perception always 
plays a réle in learning of the kind to 
which we are referring. It is also a 
characteristic of such learning that 
similar items or activities follow one 
another in many repetitions. ‘Thus, 
conditions for the development of 
satiation are quite favorable. It is 
therefore hardly surprising to find 
that, when these forms of learning are 
investigated, they are accompanied 
by symptoms which clearly resemble 
those of satiation, and also by phe- 
nomena of recovery after rest periods. 
Whatever the nature of memory 
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traces, of associations, and so forth, 
may be in other respects, they must 
generally constitute patterns. If, 
when these patterns develop, patterns 
of satiation are established at the 
same time, consequences are bound to 
be disturbing, particularly since in 
often repeated trials satiation tends 
to spread from more restricted into 
wider areas. It must also be re- 
membered that under the conditions of 
such experiments not only one trace 
and one related satiation pattern but 
a number of patterns of both kinds 
are formed. 


For a while, it was believed that the 
inhibitions observed in such situations 
could be explained by the competition of 
the various recall tendencies which arise 
when many items of a similar kind are 
given. While confusions of this kind 
undoubtedly occur, they are now no 
longer regarded as the only causes of the 
inhibitions. There must be other factors 
which operate in the same direction. 
Future investigations will have to show 
whether satiation is actually one of these 
factors. At any rate, however, on the 
assumption that it is, such facts as the 
favorable influence of rest intervals, 
reminiscence, and the like, could be ex- 
plained without great difficulties. For, 
so long as satiation is not extreme, it 
tends, of course, to weaken within a 
relatively short time. 


In the preceding discussion one 
point has not been quite satisfactorily 


explained. We have distinguished 
between memory traces and patterns 
of satiation, and since the former con- 
cept has not been given an interpreta- 
tion in biophysical terms, we have 
not been able to indicate precisely 
how patterns of satiation would 
affect memory traces. We must 
therefore now ask ourselves how 
satiation is related to memory. The 
question obtrudes itself for this 
reason. If patterns of satiation are 
actually formed by currents which ac- 


company pattern vision, then such 
patterns must be quite adequate rep- 
resentations of the processes by which 
they are established. For satiation 
must develop not only in the same 
spatial distribution as is characteris- 
tic of the current in question, but 
intensity relations which obtain 
among the various parts of the current 
must also be repeated in the satiation 
pattern, because the polarizing action 
of a current is at each point directly 
related to the local density of the flow. 
To be sure, when satiation becomes so 
strong that the distribution of the 
current is affected, further satiation 
will no longer follow the original pat- 
tern. But in our present connection 
the case of intense satiation need not 
be considered. We are now interested 
mainly in the patterns of satiation 
which develop when objects or activi- 
ties are given only for very limited 
periods. Under these circumstances, 
we repeat, such patterns must strongly 
resemble the processes by which they 
are established. 

Little is known about memory 
traces. They are defined by certain 
operations, such as recognition, recall, 
and so forth, which they are supposed 
to explain. But if they are to serve 
this function, we must ascribe to them 
at least one fundamental characteris- 
tic: to a considerable degree, memory 
traces must resemble the processes by 
which they are established. It is 
true that sometimes we have reasons 
to suspect that the traces are defective; 
but this very expression points to the 
fact that in many instances the cor- 
respondence must be fairly good. As 
a consequence, the theoretical situ- 
ation in this field is now as follows. 
It is assumed that the brain processes 
which go with psychological events 
establish memory traces, and that 
these traces resemble the processes in 
question. But at the same time these 
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processes form satiation patterns 
which must also be adequate repre- 
sentations of the processes. It seems 
hardly natural to believe that a given 
process establishes simultaneously two 
altogether different effects in the 
nervous system which are both virtu- 
ally pictures of that process. Thus 
the question arises whether the two 
effects, memory traces and satiation 
patterns, can perhaps be identified. 
But memory traces are only most indi- 
rectly defined, while patterns of sati- 
ation have now been given an inter- 
pretation in fairly specific biophysical 
terms. Under these circumstances, 
the two concepts can be profitably 
identified only if the concept which is 
less well understood is reduced to the 
one which is much better defined. 
Hence our question must actually be 
whether satiation patterns can be 
assumed to play the part which is 
commonly attributed to memory 
traces. 

We are by no means prepared to 
give a definitive answer. Such an 
answer will be possible only after 
many investigations both in the field 
of memory and in that of satiation. 
Even at the present time, however, 
the question deserves serious attention 
because of certain observations which 
we will now describe. 

If it is satiation which makes the 
M.-L. illusion decrease or disappear, 
then our experiments prove that sati- 
ation is a far more persistent condition 
than we had previously suspected. 
The first observation which showed 
that a figural after-effect may be 
demonstrated after many hours was 
made by Gibson (2). This observa- 
tion was confirmed by Kohler and 
Wallach (8) who found that, when 
many satiation periods have been 
given, resulting after-effects may sur- 
vive for more than a day. On the 
premise that the M.-L. illusion is 
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destroyed by satiation, all instances 
in which it was destroyed by Judd, 
Seashore (11) and Lewis prove the 
same fact. For, in their work, the 
development was completed only when 
experimentation had been continued 
for several days, if not for wecks. 
This implies that the satiation estab- 
lished in one session did not disappear 
from one day to the next. We know, 
however, that, when sometimes rest 
periods of several days were given, the 
illusion tended to decrease more 
rapidly. Consequently, satiation in 
the ‘right’ places survived such longer 
periods also. In our investigation, 
we made similar observations. Rest 
periods of 48 hours were often particu- 
larly effective. Even when an S had 
not yet done many experiments, one 
day’s trials often clearly influenced 
results two days later. This was true 
also of Subject F, who was given 
mainly periods of fixation rather than 
comparison trials, and whose experi- 
mental sessions were separated by 
rest intervals of 48 hours. 

Gradually, we began to suspect 
that, once a strong effect is established, 
it may survive for much longer periods 
than two days. Thus, most of our 
Ss were given delayed tests when 
regular sessions had been completed. 
Results were as follows: 


With Subject A, experiments were 
interrupted for two months. After this 
period, we found a negative illusion of 
the same size as had been measured be- 
fore the interruption. Four months 
after this test, the negative illusion was 
still present, although now it had slightly 
decreased. 

With Subject B, tests given one month 
after the last regular trials still showed a 
considerable negative illusion. Four 
months later, no negative illusion was 
found, but there was still no positive 
illusion. 

With Subject D, the illusion had just 
disappeared when regular experimenta- 








406 


tion was discontinued. After an interval 
of four months, the illusion was still 
absent. 

With Subject E, we interrupted experi- 
ments when the illusion had been com- 
pletely destroyed. A first delayed test 
was given three weeks later, and a second 
four months after the first. No reliable 
deviation from objective equality of the 
crucial distances was found in these tests. 

With Subject F, regular experiments 
were discontinued when a negative illu- 
sion had been established. Three weeks 
later, the negative illusion had disap- 
peared, but there was also no positive 
illusion, 


It will be remembered that, when 
the illusion had been destroyed, our 
Ss were also tested with patterns of 
short vertical lines which passed 
through the points where the corners 
of the angles had been located in the 
illusion pattern. The distances be- 
tween these lines were found to differ 
in the expected direction. Measure- 
ments of this kind were also done after 
long delays. Particular care was 
taken to prevent delayed measure- 
ments with the illusion pattern from 
affecting the tests with the new pat- 
tern. The delay periods mentioned 
in the following are, in each case, 
intervals between the last preceding 
trials with the illusion figure and the 
time of the measurements with the 
new pattern. We found these results: 


Subject A. When first measured, the 


distortion amounted to 12 percent. 
Four months after the last tests with the 
M.-L. figure, it still had the same size. 

Subject B. One week after the last 
preceding trials with the illusion pattern, 
the distortion was 11 percent. Four 
months later, its size was 10 percent. 

Subject C. The distortion was meas- 
ured only once, namely, after a delay of 
one month. It then amounted to 6 
percent. 

Subject D. First measurements gave 
the value 14 percent. The same value 
was found when, five and a half months 
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after the last tests with the M.-L. pat- 
tern, the measurement was repeated. 

Subject E. When first measured, the 
distortion was 6 percent. It was still 
6 percent four months after the last ex- 
periments with the illusion figure. 

Subject F. First measurements gave 
the value 12 percent. Three weeks 
later, we found 6 percent. 


Results of this kind will perhaps not be 
regarded as conclusive so long as tests 
remain restricted to the special instance 
of the destruction of an illusion pattern, 
and as therefore doubts may be left 
whether it is actually satiation which 
here persists for very long periods. But 
since these experiments were done, simi- 
lar results have been obtained by Kéhler 
and Dinnerstein, who investigated kin- 
esthetic rather than visual after-effects 
(6). Gibson had shown that his curved- 
line effect can also be demonstrated in 
kinesthesis (2). K. and D. found that, 
when Ss have for some time held objects 
of a certain width between the fingers of 
one hand, test objects which are wider 
afterwards appear too wide, while objects 
which are narrower now appear too nar- 
row. The direction of such after-effects 
is the same as that found with compa- 
rable arrangements in vision. When 
strong distortions of this kind had been 
established in a series of widely spaced 
satiation periods, two Ss were given final 
tests afterlong delays. At the end of the 
regular trials, in which the width three 
in. had been used for the satiation of one 
hand, an object with the actual width 
four in. held in the same hand appeared 
in both cases equal to an object with the 
width five in. in the other hand. When 
one of the Ss was tested after three and a 
half months, the distortion had decreased 
only from one to .8 in. With the other 
S, who was tested after five months, the 
size of the effect was now .7 in. Re- 
cently we were asked by a psychologist 
whether such observations could be 
reconciled with plausible theoretical ex- 
pectations. He was willing, fortunately, 
to serve as a subject. With a slightly 
different arrangement, an after-effect was 
soon established which made the width 
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1.§ in. in one hand appear equal to 2.9 
in. in the other hand. After a first delay 
of one and a half months without trials, 
the distortion of 1.4 in. was reduced to 
1.2 in. Four months later, it still 
amounted to 1.0 in. It goes without 
saying that in this case, as in all our de- 
layed tests, no satiation period was given 
before measurements. 

The arrangements used in these kin- 
esthetic experiments involved no illu- 
sions. The distortions which proved to 
be so persistent were therefore undoubt- 
edly caused by satiation. Naturally, 
similar tests should also be performed 
with visual satiation patterns which are 
in no sense ‘abnormal.’ But in view of 
our results with kinesthetic after-effects 
we see no reason why our tests with the 
M.-L. figure should not be accepted as 
proof that visual satiation may persist 
for very long periods.’ 

Quite apart from more particular 
arguments, the idea that memory of 
patterns is mediated by the polarizing 
action of currents in the brain appears 
plausible enough. In fact, one of us 
discussed this possibility before the 
concept of figural after-effects in its 
present sense was known, i.e., before 
polarization could be made responsible 
for the formation of satiation patterns 
(5, Chap. 7). Now that the theory 
of satiation is fairly well developed, it 
proves to be virtually identical with 
the theory previously suggested in the 
case of pattern memory. This is, of 
course, merely a consequence of the 
fact to which we referred in an earlier 
paragraph: memory traces of patterns 
and moderate satiation effects of these 
patterns must have the same struc- 
tures, since both are formed simultane- 
ously by the same events, and must 


* So far only one experiment is available which 
demonstrates great persistence in the case of a 
‘normal’ visual after-effect. Mr. W. Lichten 
(cf. 7, footnote) measured the after-effect of his 
S five months after the last satiation period had 
been given. At the time, the size of the effect 
was still 73 percent of its size before the delay. 
The pattern which he used involved no illusion. 
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both resemble these events. We 
found it hard to accept this dualism 
under any circumstances. Surely, a 
strictly dualistic view is almost in- 
compatible with the evidence which 
has now been presented. For, ac- 
cording to this evidence satiation pat- 
terns may be persistent to a degree 
which makes them comparable to 
memory traces in this respect also. 
The longest delay periods after which 
tests were made amounted to about 
half a year. Since at the time sati- 
ation effects were not only present but 
quite marked, we have no reason to 
suppose that they would have entirely 
disappeared within a year. To this 
we must add that so far delayed tests 
have been given only when no new 
satiation periods preceded these tests. 
In the future, the same problem must, 
of course, also be examined in the way 
which is so often used in memory ex- 
periments. Even when recall as such 
is no longer possible, considerable 
retention may be demonstrated by 
showing that relearning is still easier 
than original learning has been. Sim- 
ilarly, the persistence of satiation 
patterns ought to be subjected to 
more sensitive tests by measuring the 
amount of new satiation which re- 
establishes an after-effect of pre- 
scribed size after a very long interval. 

Identification of memory traces 
with satiation patterns would also 
remove a difficulty which we men- 
tioned in the beginning of this dis- 
cussion. We had found it plausible 
to assume that the inhibitions en- 
countered in certain forms of learning 
are, at least in part, of the same kind 
as the obstacles which tend to prevent 
the formation of sharply defined 
satiation patterns. But at the time 
we were unable to say precisely how 
memory traces would be affected by 
satiation patterns, if both developed 
simultaneously in a given learning 
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process. On the hypothesis that 
memory traces are themselves sati- 
ation patterns, this question could 
readily be answered. Satiation does 
not remain restricted to its original 
pattern when inspection periods are 
too long or too frequent. The fact, 
we have seen, follows without any 
additional assumption from the na- 
ture of satiation as such. Thus, if 
memory traces were satiation pat- 
terns, the mere accumulation of many 
similar traces in a given learning proc- 
ess would naturally distort the pat- 
terns in question; in other words, the 
inhibitions which tend to accompany 
the formation of such traces would 
follow from the same principle as 
would be used in explaining the traces 
themselves. We need not, of course, 
repeat what has been said before 
about reminiscence and related phe- 
nomena. Our explanation of these 
facts would not be altered by the 
assumption that memory traces are 
satiation patterns. 


So far we have considered only argu- 
ments which make this assumption ap- 
pear plausible. Actually, however, we 
are just as strongly impressed by ques- 
tions which must here be raised, pre- 
cisely because they cannot yet be clearly 
answered. In the first place, if ordinary 
memory traces were satiation patterns, 
they would have to be weak editions of 
such patterns; for we have no evidence 
that common memory traces are capable 
of causing figural after-effects. More- 
over, memory traces are generally formed 
within very short periods, during which 
no higher degrees of satiation can be 
established. As a matter of principle, 
it is, of course, perfectly sound to refer to 
satiation patterns which are too weak to 
cause figural after-effects. In fact, K. 
and W. have indirectly proved that such 
patterns exist (8, p. 354). But it has 
not been proved that the persistence of 
such weak patterns is comparable to 
that of memory traces, and at the present 
time we see no way in which such a proof 
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could be given. No conclusion in this 
respect can be drawn from tests such as 
ours, in which satiation patterns are 
shown to be extremely persistent after 
long and repeated inspection periods. 

A second problem may be more readily 
accessible to experimentation. Many 
memory traces, such as those of words, 
rhythms, melodies and actions, are pat- 
terns in a temporal sense. The achieve- 
ments which are made possible by mem- 
ory of this kind can be understood only 
if the sequence in which facts have fol- 
lowed one another persists in the traces. 
The theory of satiation, on the other 
hand, has so far been formulated as 
though successively established satiation 
effects formed a compound pattern in 
which their succession is no longer repre- 
sented. Actually, we do not wish to 
defend this formulation; it has been used 
only because in this fashion first theoreti- 
cal discussions could be greatly simpli- 
fied. There are facts in the field of 
satiation itself which strongly argue 
against sucha view. In other words, not 
only satiation effects per se but to a degree 
also their sequence must be represented 
in the pattern which is formed when 
satiating events follow one another. In 
this respect, therefore, a more complete 
theory of satiation is bound to make the 
same assumptions as can be derived from 
the temporal characteristics of certain 
memory patterns. We should like to 
add that it would by no means be 
astonishing if the temporal order of 
polarizing events were represented in 
the electrotonic patterns which they 
establish. 

A third problem refers to the way in 
which memory traces are reactivated. 
Even if it were taken for granted that 
memory traces are satiation patterns, 
this problem would remain unsolved. 
For at the present time the operational 
definition of such patterns is given in 
terms of figural after-effects, and the role 
which test patterns play in these effects 
appears to be quite different from that of 
objects which are recognized, or cause 
recall. To mention only the greatest 
difference, figural after-effects are fairly 
well localized, while recognition and re- 
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call are virtually independent of the 
location in which corresponding memory 
traces have been formed. It seems to 
follow that, if memory traces were sati- 
ation patterns, we could not hope to 
clarify their reactivation on this ground, 
so long as figural after-effects are our only 
means of studying such patterns. Obvi- 
ously, the nature of memory traces is only 
yne problem in the field of memory, and 
their behavior in actual recognition or 
recall is an equally important second 
problem. We should like to emphasize 
that only the former problem has been 
discussed in the preceding paragraphs. 


In concluding, it will be advisable 
to return once more to the destruction 
of the M.-L. illusion in repeated trials. 
This effect, we have maintained, is 
caused by satiation rather than by 
learning. But we have also con- 
sidered the possibility that memory 
traces and satiation patterns are facts 
of essentially the same kind. If this 


thesis should prove to be correct, 


would it not follow that in this fashion 
the ‘practice’ theory which we claim 
to have refuted is after all reestab- 
lished? 

Actually, such a conclusion would 
be mistaken. It would be based on a 
certain ambiguity of such terms as 
‘practice’ and ‘learning.’ Those who 
believed that the M.-L. illusion is de- 
stroyed by learning used this concept 
in a quite specific sense. According 
to them the illusion decreases, and is 
finally destroyed, because Ss learn to 
assume a particular analytical attitude 
which they then apply with ever- 
growing ease. Our arguments against 
this view remain in force quite inde- 
pendently of any hypothesis which 
refers to the nature of memory traces. 
To be sure, if memory traces should 
prove to be satiation patterns, our 
explanation, according to which the 
illusion is destroyed by satiation, 
could perhaps be said to be at the same 
time an explanation in terms of mem- 


ory traces. For, on this assumption, 
it would be an accumulation of traces 
in particular areas of the M.-L. figure 
which gradually destroys the illusion. 
We very much doubt, however, 
whether a process of this kind should 
be called learning, if the same term is 
also used when Ss develop new proce- 
dures or skills. At any rate, our 
theory would still remain quite differ- 
ent from the ‘practice’ theory. For 
it would be our thesis that the illusion 
disappears when traces of the M.-L. 
pattern as such accumulate, while the 
‘practice’ theory postulates a training 
which refers to particular activities of 
the subject. 


SUMMARY AND CONCLUSIONS 


1. The destruction of the Miiller- 
Lyer illusion in repeated trials is 
impeded by long series of experiments, 
and accelerated by long rest periods. 
These facts follow from the theory of 
satiation. Similar phenomena in the 
field of memory may partly also be 
caused by satiation. 

2. Figural after-effects may persist 
for large parts of a year. It seems 
possible that memory traces are weak 
patterns of satiation; but at the 
present time no convincing proof of 
this thesis can be given. 


(Manuscript received for immediate 
publication January 25, 1950) 
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